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<B) TELEFAX: (415) 576-0300 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc "DNA" 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:l: 





AAAACCCCAA 


AACCCCAAAA 


CCCCTTTTAG 


AGCCCTGCAG 


TTGGAAATAT 


AACCTCAGTA 


60 




TTAATAAGCT 


CAGATTTTAA 


ATATTAATTA 


CAAAACCTAA 


ATGGAGGTTG 


ATGTTGATAA 


120 




TCAAGCTGAT 


AATCATGGCA 


TTCACTCAGC 


TCTTAAGACT 


TGTGAAGAAA 


TTAAAGAAGC 


180 




TAAAACGTTG 


TACTCTTGGA 


TCCAGAAAGT 


TATTAGATGA 


AGAAATCAAT 


CTCAAAGTCA 


240 




TTATAAAGAT 


TTAGAAGATA 


TTAAAATATT 


TGCGCAGACA 


AATATTGTTG 


CTACTCCACG 


300 


AGACTATAAT 


GAAGAAGATT 


TTAAAGTTAT 


TGCAAGAAAA 


GAAGTATTTT 


CAACTGGACT 


360 


ui 


AATGATCGAA 


CTTATTGACA 


AATGCTTAGT 


TGAACTTCTT 


TCATCAAGCG 


ATGTTTCAGA 


420 


TAGACAAAAA 


CTTCAATGAT 


TTGGATTTCA 


ACTTAAGGGA 


AATCAATTAG 


CAAAGACCCA 


480 


m 

}* 


TTTATTAACA 


GCTCTTTCAA 


CTCAAAAGCA 


GTATTTCTTT 


CAAGACGAAT 


GGAACCAAGT 


540 




TAGAGCAATG 


ATTGGAAATG 


AGCTCTTCCG 


ACATCTCTAC 


ACTAAATATT 


TAATATTCCA 


600 




GCGAACTTCT 


GAAGGAACTC 


TTGTTCAATT 


TTGCGGGAAT 


AACGTTTTTG 


ATCATTTGAA 


660 


s :i 


AGTCAACGAT 


AAGTTTGACA 


AAAAGCAAAA 


AGGTGGAGCA 


GCAGACATGA 


ATGAACCTCG 


720 


03 


ATGTTGATCA 


ACCTGCAAAT 


ACAATGTCAA 




w/i X w*V>-» 1 1 XL 




/oU 




CAACGTGCCG 


AATTGGAATA 


ATATGAAATC 


AAGAACCAGA 


ATATTTTATT 


GCACTCATTT 


840 


|M 


TAATAGAAAT 


AACCAATTCT 


TCAAAAAGCA 


TGAGTTTGTG 


AGTAACAAAA 


ACAATATTTC 


900 




AGCGATGGAC 


AGAGCTCAGA 


CGATATTCAC 


GAATATATTC 


AGATTTAATA 


GAATTAGAAA 


960 




GAAGCTAAAA 


GATAAGGTTA 


TCGAAAAAAT 


TGCCTACATG 


CTTGAGAAAG 


TCAAAGATTT 


1020 




TAACTTCAAC 


TACTATTTAA 


CAAAATCTTG 


TCCTCTTCCA 


GAAAATTGGC 


GGGAACGGAA 


1080 




ACAAAAAATC 


GAAAACTTGA 


TAAATAAAAC 


TAGAGAAGAA 


AAGTCGAAGT 


ACTATGAAGA 


1140 




GCTGTTTAGC 


TACACAACTG 


ATAATAAATG 


CGTCACACAA 


TTTATTAATG 


AATTTTTCTA 


1200 




CAATATACTC 


CCCAAAGACT 


TTTTGACTGG 


AAGAAACCGT 


AAGAATTTTC 


AAAAGAAAGT 


1260 




TAAGAAATAT 


GTGGAACTAA 


ACAAGCATGA 


ACTCATTCAC 


AAAAACTTAT 


TGCTTGAGAA 


1320 




GATCAATACA 


AGAGAAATAT 


CATGGATGCA 


GGTTGAGACC 


TCTGCAAAGC 


ATTTTTATTA 


1380 




TTTTGATCAC 


GAAAACATCT 


ACGTCTTATG 


GAAATTGCTC 


CGATGGATAT 


TCGAGGATCT 


1440 
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CGTPGTPTPG 


p to a *r t arc a hp 

luni x AvjrAx 


bAi ill iblA 


1 TCACCGAG 


/^TV TV ^"ITV ^1 TV TV TV TV 

CAACAGAAAA 


GTTACTCCAA 


1500 




AJVPPTATTAP 


m apap.aaap.a 

1 riUibAAAuA 


AiAl i HjUvjA 


CGTCATTATG 


TV TV TV IkfflMmMH It 

AAAATGTCAA 


TCGCAGACTT 


1560 




AAAGAACJC5AA 


"Lbt X lub lb 


Abjbi 1 CLAAbjA 


TV A A A /"T TV /^/"ttiifn 

AAAAGAGGTT 


r^iTV tv /1 tv tv m s*i y« -^i 

GAAGAATGGA 


AAAAGTCGCT 


1620 




J. XXX uLn 


pp m np.a a a a r» 

v-C 1 VaGAAAAC 


T 1 ^ A 7\ OTA AT 
i CAbxAC 1 AA 1 


A /T/rri TV A /"I TV TV TV 

ACCGAAGAAA 


ACTACTTTCC 


GTCCAATTAT 


1680 




H. 21 PTTTP a a T 
unL 111 Liuil 


a A07\ TVPTiTTP 

AAG AAG A i i G 


TAAATTCAGA 


r-1/~1/-t 71 TV /^TV /'II n 

CCGGAAGACT 


ACAAAATTAA 


CTACAAATAC 


1740 




C5A ZXHTTATTn 


AAb 1 C i LAL I 


iAAlvaCI IAA 


GACATTGAAG 


AATAGAATGT 


TTAAAGATCC 


1800 




x x x x uun x X ^» 


bbibi. iii 1A 


AC lAibiAlbjA 


fTl/~irtT7V TVm^lTV TV TV 

TGTAATGAAA 


AAGTATGAGG 


AGTTTGTTTG 


1860 






r*a a r*TTw*« a r» 
CAAGi iGGAC 


TV A OO A TV TV TV nni 

AACCAaaAcT 


CTTCTTTGCA 


ACTATGGATA 


TCGAAAAGTG 


1920 




A 1 A 1 uA 1 Ab* 1 


OTTA A A OTV /I A O 

GTAAACAGAG 


TV TV TV TV TV /"irriTV mn 

AAAAACTATC 


AACATTCCTA 


AAAACTACTA 


AATTACTTTC 


1980 




x 1 V.AuAi lit 


iGGAi lAiGA 


nHI/l/^Tt /TT\ TV TV m 

CTGCACAAAT 


TCTAAAGAGA 


AAGAATAACA 


TAGTTATCGA 


2040 




I 1 CLtAAAAAC 


TTTAGAAAGA 


AAGAAATGAA 


AGATTATTTT 


AGACAGAAAT 


TCCAGAAGAT 


2100 




1 bxCAC 1 1 bM 


(~*/~* A OO TV /"» TV 7V m 

GGAGGACAAT 


ATCCAACCTT 


ATTCAGTGTT 


CTTGAAAATG 


AACAAAATGA 


2160 


i 


CI lAAAlbiCA 


A A /t TV A TV TV /"*TV ft! 

aAGaAAACAT 


TAATTGTTGA 


AGCAAAGCAA 


AGAAATTATT 


TTAAGAAAGA 


2220 


D 

a 


fP7V A /'li 1 if ■ 17V /"H 1 ill I 

1 AAC 1 1 AC J/T 


^TTV TV rt/s» ^im^iit 

CAACCAGTCA 


TTAATATTTG 


CCAATATAAT 


TACATTAACT 


TTAATGGGAA 


2280 


iii 


'T'l 1 1 • A *P TV A 7V 

. bllllAlAAA 


OTV A A /TTV Tl TV TV /""I 

CaAACAAAAG 


tv tv mm /'im « m 

GAATTCCTCA 


AGGTCTTTGA 


GTTTCATCAA 


TTTTGTCATC 


2340 


si« 
m 


Al i 1 1AI I Ai 


GCAACATTAG 


TV /^^ITV TV TV ^l^im^*! 

AGGAAAGCTC 


CTTAGGATTC 


CTTAGAGATG 


AATCAATGAA 


2400 


La 

U 


Of /"""TV* A A A A rn 

LLL I VaAAAAT 


^i/"Itv tv tv m/inwn^ 

CCAAATGTTA 


ATCTTCTAAT 


GAGACTTACA 


GATGACTATC 


TTTTGATTAC 


2460 


?■ 


TV TV /"•TV'TV 7V O 7V r* 

AAC I CAAGAG 


TV TV TITV TV m/T /T TV /~1 

aataatgcag 


TATTGTTTAT 


TGAGAAACTT 


ATAAACGTAA 


GTCGTGAAAA 


2520 


0 

H 


iGGAl i I AAA 


fflmnTi tv rrnv m^tTv 

TTCAATATGA 


AGAAACTACA 


GACTAGTTTT 


CCATTAAGTC 


CAAGCAAATT 


2580 


; . t 


'POOa A TV TV i"PTV 

1 GCAAAA JL AC 


/r /"« TV TV n-tfi/t tv mTV 

GGAATGGATA 


GTGTTGAGGA 


yvirt« «n k a m« 

GCAAAATATT 


GTTCAAGATT 


ACTGCGATTG 


2640 




vjAi IGGCAi C 


nr/"*A T\ 1 1 ii i \/~* i\ mTv 

TCAATTGATA 


m^*1 TV TV TV Tl /tm/'llll 

TGAAAACTCT 


TGCTTTAATG 


CCAAATATTA 


ACTTGAGAAT 


2700 




A OA an/"* A A mrp 
AVjAAVjVxAAx x 


C iG i G iACAC 


HTOTV TV rTT/~lfT»T\ TV TV 

TCaATCTAAA 


CATGCAAACA 


AAGAAAGCAT 


CAATGTGGCT 


2760 




PaaOAaOAATV 
CAAGAAGAAA 


/^trp A TV A/TTVTOm 

CTAAaGTCGT 


II II 1 Ii III ii tv tv mmi n 

TTTTAATGAA 


TAACATTACC 


CATTATTTTA 


GAAAGACGAT 


2820 




m a p a a ooo a a 


/"I A OTWT"TVI f1f*t TV 

GAC x TTGCGA 


ATAAAACTCT 


CAACAAGTTA 


TTTATATCAG 


GCGGTTACAA 


2880 




7A T 1 7A r* A TP O TV TV 

Ai ACA1GCAA 


TV* AO /TO TV TV TV /T 

TGAGCCAAAG 


TV TV rnTV /""I TV TV /T^l Tl 

AATACAAGGA 


CCACTTTAAG 


AAGAACTTAG 


CTATGAGCAG 


2940 




T a TCI 7A Tr*c* a o 


TTAGAGGTAT 


CTAAAATTAT 


ATACTCTGTA 


ACCAGAGCAT 


TCTTTAAATA 


3000 




cci lbmmc 


71 A HTA nriTTTV TV /'*1/*t 

AATATTAAGG 


ATACAATTTT 


TGGAGAGGAG 


CATTATCCAG 


ACTTTTTCCT 


3060 




TAGCACACTG 


AAGCACTTTA 


TTGAAATATT 


CAGCACAAAA 


AAGTACATTT 


TCAACAGAGT 


3120 




TTGCATGATC 


CTCAAGGCAA 


AAGAAGCAAA 


GCTAAAAAGT 


GACCAATGTC 


AATCTCTAAT 


3180 




TCAATATGAT 


GCATAGTCGA 


CTATTCTAAC 


TTATTTTGGA 


AAGTTAATTT 


TCAATTTTTG 


3240 




TCTTATATAC 


TGGGGTTTTG 


GGGTTTTGGG 


GTTTTGGGG 






3279 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) • LENGTH : 1031 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly He His Ser 
1 5 10 * 15 

Ala Leu Lys Thr Cys Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser 
20 25 30 

Trp He Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr 
35 40 45 

Lys Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala 
50 55 60 

Thr Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys 
65 70 75 80 

Glu Val Phe Ser Thr Gly Leu Met He Glu Leu He Asp Lys Cys Leu 

Uj 85 90 95 

m Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 

?! 100 105 110 

f* . 

H Cys Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu 

F 115 120 125 

CI 

r: Leu Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp 

130 135 ' 140 

'0 Asn Gin Val Arg Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr 

*S ■ 145 150 155 ^ 160 

]U Thr Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin 

165 170 175 

Phe Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe 
180 185 190 

Asp Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys 
195 200 205 

Cys Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu 
210 215 220 

Asn Asn He Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg 
225 230 235 ~ 240 

He Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys 
245 250 255 

His Glu Phe Val Ser Asn Lys Asn Asn He Ser Ala Met Asp Arg Ala 
260 265 270 
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Gin Thr He Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys 
275 280 285 

Leu Lys Asp Lys Val He Glu Lys He Ala Tyr Met Leu Glu Lys Val 
290 295 300 

Lys Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro 
305 310 315 320 

Glu Asn Trp Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys 
325 330 335 

Thr Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr 
340 345 350 

Thr Asp Asn Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn 
355 360 365 

He Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin 
370 375 ^ 380 

Lys Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His 
385 390 395 400 

Lys Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met 
405 410 ~ 415 

Gin Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn 
420 425 430 

He Tyr Val Leu Trp Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val 
435 440 445 

Val Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser 
450 455 460 

Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met 
465 470 475 480 

Lys Met Ser He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin 
485 490 495 

Glu Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly 
500 505 510 

Lys Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr 
515 520 525 

Phe Asn Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr 
530 535 * 540 

Thr Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lvs 
545 550 555 560 

Asn Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp 
565 570 575 

Asp Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val 
580 585 590 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr 
595 600 605 



Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
610 615 620 

Leu Leu Ser Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg 
625 630 635 640 

Lys Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met 
645 650 ~ 655 

Lys Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly 
660 665 670 

Gin Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu 
675 680 685 

Asn Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe 
690 695 700 

Lys Lys Asp Asn Leu Leu Gin Pro Val He Asn He Cys Gin Tyr Asn 
705 710 715 " 720 

Tyr He Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro 
725 730 ' 735 

Gin Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr 
740 745 750 

Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro 
755 760 765 

Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu 
770 775 780 

Leu He Thr Thr Gin Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu 
785 790 795 800 

He Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
805 810 815 

Gin Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met 
820 825 * 830 

Asp Ser Val Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp He 
835 840 845 

Gly He Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn 
850 855 860 

Leu Arg He Glu Gly He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr 
865 870 875 880 

Lys Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met 
885 890 895 

Asn Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe 
900 90S 910 

Ala Asn Lys Thr Leu Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr 
915 920 925 

Met Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala 
930 935 940 



Met Ser Ser Met He Asp Leu Glu Val Ser Lys He He Tyr Ser Val 
945 950 955 960 

Thr Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He 
965 970 ~ 975 

Phe Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His 
980 985 990 

Phe He Glu He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 1000 1005 

Met He Leu Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin 
1010 1015 1020 

Ser Leu He Gin Tyr Asp Ala 
1025 ~ 1030 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
Ja (D) TOPOLOGY: linear 



m 



o 



(ii) MOLECULE TYPE: other nucleic acid 



P (A) DESCRIPTION: /desc m "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG GTAGTTTAGA 60 

U AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA TTTGGATGAT ATAGAAAATT 120 

p TACTTCCTAA TACATTCAAC AAGTATAGCA GCTCTTGTAG TGACAAGAAA GGATGCAAAA 180 

CATTGAAATC TGGCTCGAAA TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG 240 

AGTTCTACTT CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 300 

AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA AATCAGGTAA 360 

TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA TTATGGAGAA AATTACTTAA 420 

TACTAAAAGG TAAACAGTTT GGATTATTTC CCTAGCCAAC AATGATGAGT ATATTAAATT 480 

CATATGAGAA TGAGTCAAAG GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT 540 

AAAACGCAAG AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 600 

TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC TTGAGACAAT 660 

TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA AGTTCTGATG TGTATGCCAT 720 

TATTTTGTGA ATTAATCTCA AATATCTTAT CTCAATTTAA TGGATAGCTA TAGAAACAAA 780 

CCAAATAAAC CATGCAAGTT TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC 840 

TGAATTTATA TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 900 

TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT AAAAGAAGCA 960 



.Hauls BajLHH iuimiiuj. uwiim Mm 





AAAG AAG i 


TV 7\ TV V TV TV TV VI TV /"I 

AAAGAAAGAG 


ATTTCAAAAT TTGTTGATTC TTCTGTAACC 


1020 


GGA ATT7A &P A 




T»TVO/^TV 7V /*l/*"<TV TV 

TAGCAACGAA 


AAAGAAGAAG AGCTATCACA ATCCTGATTC 


1080 


TT 1 A A Af2 AT"!"! 1 


LAAAAfll lV_u 


TV, np T 1 7\ 7\ P 1 7\ P* 7\ 

AGG I AAG AG A 


GATACATTCA TTAAAATTCA TATATTATAG 


1140 


illi iwil 1 X 


HTV/^TV /^/^rp/^rprp 

UALAGCTGTT 


ATTTTCTTTT 


ATCTTAACAA TATTTTTTGA TTAGCTGGAA 


1200 


GlAAAAAGiA 


rn^n tv t\ mil t* -m 

TCAAATAAGA 


GAAGCGCTAG 


ACTGAGGTAA CTTAGCTTAT TCACATTCAT 


1260 




rp/~»7» err tv mil rn/"*i/^ 

TCATATATCC 


TV TV m TV /""t/"1 T* rTTj^tTl 

AATACGATGA 


TAAGGAAACA GCAGTCATCC GTTTTAAAAA 


1320 


1AG1GV~1A.LG 


AGGACTAAAT 


TTTTAGAGTC 


AAGAAATGGA GCCGAAATCT TAATCAAAAA 


1380 


uAAl lGCGav*. 


GATATTGCAA 


AAGAATCGAA 


CTCTAAATCT TTCGTTAATA AGTATTACCA 


1440 


ATCTTGATT G 


ATTGAAGAGA 


TTGACGAGGC 


AACTGCACAG AAGATCATTA AAGAAATAAA 


1500 


GTAACTTTTA 


TTAATTAGAG 


AATAAACTAA 


ATTACTAATA TAGAGATCAG CGATCTTCAA 


1560 


TTGACGAAAT 


AAAAGCTGAA 


CTAAAGTTAG 


ACAATAAAAA ATACAAACCT TGGTCAAAAT 


1620 


ATTGAGGAAG 


GAAAAGAAGA 


CCAGTTAGCA 


AAAGAAAAAA TAAGGCAATA AATAAAATGA 


1680 


GTACAGAAGT 


GAAGAAATAA 


AAGATTTATT 


TTTTTCAATA ATTTATTGAA AAGAGGGGTT 


1740 


TTGGGGTTTT 


GGGGTTTTGG 


GG 




1762 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Pro Gin Asn Pro Lys Thr Pro Lys Pro Leu Lys Lys Lys Lys Leu Arg 
1 5 10 ' 15 

Phe Arg Asn Lys lie Leu Phe Pro His Lys Trp Arg Trp lie Leu lie 
20 25 " 30 

Trp Met He Lys He Tyr Phe Leu He His Ser Thr Ser He Ala Ala 
35 40 45 

Leu Val Val Thr Arg Lys Asp Ala Lys His Cys Asn Leu Ala Arg Asn 
50 55 60 

Arg Leu His Cys Leu Phe Gin Ser Cys Lys Asn Asn Ser Ser Thr Ser 
65 70 75 80 

Arg Met Gin He Phe He Thr He Leu Ser Cys Glu Asn Phe Lys Ala 
85 90 95 

Glu Ser Lys Glu Lys Leu Lys His Tyr Cys Leu Asn Lys He Arg Cys 
100 105 110 

Gly Leu Phe Tyr Phe Leu Asp His Phe Leu Arg Ser He Met Glu Lys 
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115 



120 



125 



He Thr Tyr Lys Val Asn Ser Leu Asp Tyr Phe Pro Ser Gin Gin Cys 
130 135 140 

Cys Val Tyr He His Met Arg Met Ser Gin Arg He Ser He His Gin 
145 150 155 160 

Thr Tyr Gin Arg Gin Thr Arg Tyr Lys Thr Gin Glu Lys Val Cys Ser 
165 170 " 175 

Asn Ser Arg Arg Thr Tyr Cys He Tyr Tyr Ser Tyr Gly Phe Tyr Tyr 
180 185 " 190 

Asn Cys Phe Arg Tyr Arg Arg Cys Thr Pro Glu Ser Cys Asp Asn Cys 
195 200 205 

Lys Ser Cys Leu Gin Leu Lys Glu Ser Gin Phe Cys Lys Phe Cys Val 
210 215 220 

Cys His Tyr Phe Val Asn Ser Gin He Ser Tyr Leu Asn Leu Met Asp 
225 230 235 240 

Ser Tyr Arg Asn Lys Pro Asn Lys Pro Cys Lys Phe Asn Gly He Tyr 
245 250 255 

Val Lys Ser Phe Gly Thr Asn Ala His Cys He Tyr He Gly Phe Leu 
260 265 * 270 

Lys His Arg Tyr Thr Glu Cys Phe Arg Asp Cys Phe Ser Leu Gin Gin 
275 280 285 

He Thr Cys Phe Asp Tyr Ser Cys Ser Ser Leu He Ser Leu Lys Glu 
290 295 300 

Ala Gly Glu Met Lys Arg Arg Leu Lys Lys Glu He Ser Lys Phe Val 
305 310 315 320 

Asp Ser Ser Val Thr Gly lie Asn Asn Lys Asn He Ser Asn Glu Lys 
325 330 335 

Glu Glu Glu Leu Ser Gin Ser Cys Phe Leu Lys He Ser Lys He Pro 
340 345 350 

Gly Lys Arg Asp Thr Phe He Lys He His He Leu Phe Phe He Ser 
355 360 365 

Gin Leu Leu Phe Ser Phe He Leu Thr He Phe Phe Asp Leu Glu Val 
370 375 380 

Lys Ser He Lys Glu Lys Arg Thr Glu Val Thr Leu He His He His 
385 390 395 400 

Arg Ser Thr Phe He Tyr Pro He Arg Cys Gly Asn Ser Ser His Pro 
405 410 415 

Phe Lys Cys Tyr Glu Asp He Phe Arg Val Lys Lys Trp Ser Arg Asn 
420 425 430 

Leu Asn Gin Lys Glu Leu Arg Arg Tyr Cys Lys Arg He Glu Leu He 
435 440 ^ 445 
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Phe Arg Val Leu Pro He Leu He Asp Cys Arg Asp Arg Gly Asn Cys 
450 455 460 

Thr Glu Asp His Arg Asn Lys Val Thr Phe He Asn Arg He Asn He 
465 470 475 480 

Thr Asn He Glu He Ser Asp Leu Gin Leu Thr Lys Lys Leu Asn Ser 
485 490 495 

Thr He Lys Asn Thr Asn Leu Gly Gin Asn He Glu Glu Gly Lys Glu 
500 505 510 

Asp Gin Leu Ala Lys Glu Lys He Arg Gin He Lys Cys Val Gin Lys 
515 520 525 

Cys Arg Asn Lys Arg Phe lie Phe Phe Asn Asn Leu Leu Lys Arg Gly 
530 " " 535 540 

Val Leu Gly Phe Trp Gly Phe Gly 
545 550 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



01 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

hi 

! * Pro Lys Thr Pro Lys Pro Gin Asn Pro Tyr Lys Lys Arg Lys Asn Cys 

P* 15 10 15 



Gly Ser Leu Glu He Lys Tyr Tyr Ser Arg Thr Asn Gly Asp Gly Tyr 

U " * 20 25 30 



Cys Phe Gly Cys Tyr Arg Lys Phe Thr Ser Tyr He Gin Gin Val Gin 
35 * 40 45 

Leu Leu Gin Glu Arg Met Gin Asn He Glu He Trp Leu Glu He Ala 
50 ~ 55 60 

Phe He Asp Tyr Ser Lys Val Ala Lys Thr He Arg Val Leu Leu Leu 
65 70 75 80 

Gly Cys Lys Ser Leu Arg Phe Phe Leu Glu Lys He Ser Phe Lys Lys 
85 90 95 

Arg Arg Ala Lys Ser Arg Asn Cys Asn He Thr Asn Val He Lys Ser 
100 105 110 

Gly Asn Glu Asp Tyr Ser He Phe He Thr Ser Gly Ala Leu Trp Arg 
115 120 125 

Lys Leu Leu Asn Thr Lys Arg Thr Val Trp He He Ser Leu Ala Asn 
130 135 140 

Asn Asp Glu Tyr He Lys Phe He Cys Glu Cys Val Lys Gly Ser Arg 
145 150 155 160 
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Tyr lie Arg Leu Thr Lys Asp Lys Leu Ala lie Lys Arg Lys Lys Lys 
165 170 ~ 175 

Phe Asp Asn Arg Thr Ala Glu Glu Leu He Ala Phe Thr He Arg Met 
180 185 190 

Gly Phe He Thr He Val Leu Gly He Asp Gly Glu Leu Pro Ser Leu 
195 200 205 

Glu Thr He Glu Lys Ala Val Tyr Asn Cys Arg Asn Arg Ser Ser Glu 
210 215 220 



Ser Ser Asp Val Tyr Ala He He Leu Cys He Asn Leu 
225 230 235 

He Ser He Trp He Ala He Glu Thr Asn Gin He Asn 
245 250 

Leu Met Glu Tyr Thr Leu Asn Pro Leu Gly Gin Met His 
260 265 

He Leu Asp Ser Ser He Asp Thr Gin Asn Ala Leu Glu 
275 280 285 

Ala Tyr Asn Arg Leu Pro Val Leu He Thr Leu Ala His 
290 295 300 

Leu Lys Lys Gin Ala Lys Cys Lys Glu Asp Arg Lys Arg 
305 310 315 

Leu Leu He Leu Leu Pro Glu Leu Thr Thr Arg He Leu 
325 330 

Lys Lys Lys Ser Tyr His Asn Pro Asp Ser Arg Phe Gin 
340 345 

Val Arg Glu He His Ser Leu Lys Phe He Tyr Tyr Ser 
355 360 365 

His Ser Cys Tyr Phe Leu Leu Ser Gin Tyr Phe Leu He 
370 375 ~ 380 

Lys Val Ser Asn Lys Arg Ser Ala Arg Leu Arg Leu Ser 
385 390 ~ 395 

Phe He Asp Arg Pro Ser Tyr He Gin Tyr Asp Asp Lys 
405 410 

Val He Arg Phe Lys Asn Ser Ala Met Arg Thr Lys Phe 
420 425 

Arg Asn Gly Ala Glu He Leu He Lys Lys Asn Cys Val 
435 440 * 445 

Lys Glu Ser Asn Ser Lys Ser Phe Val Asn Lys Tyr Tyr 
450 455 460 

Leu He Glu Glu He Asp Glu Ala Thr Ala Gin Lys He 
465 470 475 

He Lys Leu Leu Leu He Arg. Glu Thr Lys Leu Leu He 
485 490 



Lys Tyr Leu 
240 

His Ala Ser 
255 

Thr Glu Phe 
270 

Thr Asp Leu 

Leu Leu Tyr 

Phe Gin Asn 
320 

Ala Thr Lys 
335 

Lys Phe Gin 
350 

Phe Ser Phe 

Ser Trp Lys 

Leu Phe Thr 
400 

Glu Thr Ala 
415 

Leu Glu Ser 
430 

Asp He Ala 
Gin Ser Cys 



He Lys Glu 
480 

Arg Ser Ala 
495 
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lie Phe Asn Cys Arg Asn Lys Ser Cys Thr Lys Val Arg Gin Lys lie 
500 505 510 

Gin Thr Leu Val Lys lie Leu Arg Lys Glu Lys Lys Thr Ser Gin Lys 
515 520 525 

Lys Lys Gly Asn Lys Asn Glu Tyr Arg Ser Glu Glu lie Lys Asp Leu 
530 535 540 

Phe Phe Ser He He Tyr Cys Lys Glu Gly Phe Trp Gly Phe Gly Val 
545 550 " ' 555 ~ ^ 560 

Leu Gly 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Pro Lys Pro Gin Asn Pro Lys Thr Pro He Lys Lys Glu Lys He Glu 
1 5 10 " 15 

Val Val Lys Asn He He Pro Ala Gin Met Glu Met Asp He Asp Leu 
20 25 30 

Asp Asp He Glu Asn Leu Leu Pro Asn Thr Phe Asn Lys Tyr Ser Ser 
35 40 45 

Ser Cys Ser Asp Lys Lys Gly Cys Lys Thr Leu Lys Ser Gly Ser Lys 
50 55 60 

Ser Pro Ser Leu Thr He Pro Lys Leu Gin Lys Gin Leu Glu Phe Tyr 
65 70 75 80 

Phe Ser Asp Ala Asn Leu Tyr Asn Asp Ser Phe Leu Arg Lys Leu Val 
85 90 95 

Leu Lys Ser Gly Glu Gin Arg Val Glu He Glu Thr Leu Leu Met Phe 
100 105' 110 

Lys Asn Gin Val Met Arg He He Leu Phe Phe Arg Ser Leu Leu Lys 
115 120 125 

Glu His Tyr Gly Glu Asn Tyr Leu He Leu Lys Gly Lys Gin Phe Gly 
130 135 140 

Leu Phe Pro Pro Thr Met Met Ser He Leu Asn Ser Tyr Glu Asn Glu 
145 150 155 * 160 

Ser Lys Asp Leu Asp Thr Ser Asp Leu Pro Lys Thr Asn Ser Leu Asn 
165 170 175 

Ala Arg Lys Ser Leu He He Glu Gin Gin Lys Asn Leu Leu His Leu 
180 185 190 
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Leu Phe Val Trp Val Leu Leu Gin Leu Phe Val Ser Thr Val Asn Ser 
195 200 205 

Arg Val Leu Arg Gin Leu Lys Lys Leu Phe Thr Thr Glu Gly He Ala 
210 215 220 

Val Leu Lys Val Leu Met Cys Met Pro Leu Phe Cys Glu Leu He Ser 
225 230 235 240 

Asn He Leu Ser Gin Phe Asn Gly Leu Lys Gin Thr Lys Thr Met Gin 
245 250 255 

Val Trp Asn He Arg He Leu Trp Asp Lys Cys Thr Leu Asn Leu Tyr 
260 265 270 

Trp He Leu Lys Ala He His Arg Met Leu Arg Leu He Leu Thr Thr 
275 280 285 

Asp Tyr Leu Phe Cys Leu Leu Leu Leu He Ser Tyr He Phe Lys Arg 
290 295 300 

Ser Arg Arg Asn Glu Lys Lys Thr Lys Glu Arg Asp Phe Lys He Cys 
305 310 315 " 320 

Cys Phe Phe Cys Asn Arg Asn Gin Gin Glu Tyr Gin Arg Lys Arg Arg 
325 330 335 

Arg Ala He Thr He Leu He Leu Lys Asp Phe Lys Asn Ser Arg Glu 
340 345 ^ 350 

Arg Tyr He His Asn Ser Tyr He He Val Phe His Phe Thr Ala Val 
355 360 365 

He Phe Phe Tyr Leu Asn Asn He Phe Cys Leu Ala Gly Ser Lys Lys 
370 375 380 

Tyr Gin He Arg Glu Ala Leu Asp Cys Gly Asn Leu Ala Tyr Ser His 
385 390 395 " 400 

Ser He Asp Leu His He Ser Asn Thr Met He Arg Lys Gin Gin Ser 
405 410 ^ 415 

Ser Val Leu Lys He Val Leu Cys Gly Leu Asn Phe Ser Gin Glu Met 
420 425 430 

Glu Pro Lys Ser Ser Lys Arg He Ala Ser He Leu Gin Lys Asn Arg 
435 440 445 

Thr Leu Asn Leu Ser Leu He Ser He Thr Asn Leu Asp Cys Leu Lys 
450 455 460 

Arg Leu Thr Arg Gin Leu His Arg Arg Ser Leu Lys Lys Ser Asn Phe 
465 470 475 480 

Tyr Leu Glu Asn Lys Leu Asn Tyr Tyr Arg Asp Gin Arg Ser Ser He 
485 490 " 495 

Asp Glu He Lys Ala Glu Leu Lys Leu Asp Asn Lys Lys Tyr Lys Pro 
500 505 * 510 



Trp Ser Lys Tyr Cys Gly Arg Lys Arg Arg Pro Val Ser Lys Arg Lys 
515 520 ' 525 " 
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Asn Lys Ala lie Asn Lys Met Ser Thr Glu Val Lys Lys Lys lie Tyr 
530 535 540 

Phe Phe Gin Phe lie Glu Lys Arg Gly Phe Gly Val Leu Gly Phe Trp 
545 550 ^ 555 " 560 

INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu lie Glu Asn Asn Gin Ala Gin Gin Pro Lys Ala Glu Lys Leu 
15 10 15 

Trp Trp Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp lie Gin 
20 25 30 

Val Arg Val Lys lie Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr 
35 40 45 

Ala Ala Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu 
50 55 60 

Arg Arg Tyr lie lie Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp 
65 70 75 80 

Pro Glu Phe lie Cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr 
85 90 95 

lie Arg Thr Thr Thr Asn Tyr lie Val Ala Phe Cys Val Val His Lys 
100 105 110 

Asn Thr Gin Pro Phe lie Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu 
115 120 125 

Pro Asn Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr lie 
130 135 140 

Phe Asp Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg lie Leu Ser 
145 150 155 " 160 

Gin Asp lie Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys 
165 170 175 

Val Arg Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr 
180 185 190 

Cys Thr Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val 
195 200 205 

Thr Asn Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn 
210 215 220 

Leu Leu Thr Lys Leu Gin Ala lie Lys Glu Ser Glu Asp Lys Ser Lys 
225 230 235 " 240 
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Arg Glu Thr Gly Asp He Met Asn Val Glu Asp Ala He Lys Ala Leu 
245 250 255 

Lys Pro Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys 
260 265 " 270 

Lys His Met Lys Ala Pro Lys He Pro Asn Ser Thr Leu Glu Ser Lys 
275 280 285 

Tyr Leu Thr Phe Lys Asp Leu He Lys Phe Cys His He Ser Glu Pro 
290 295 300 

Lys Glu Arg Val Tyr Lys He Leu Gly Lys Lys Tyr Pro Lys Thr Glu 
305 310 315 * 320 

Glu Glu Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn 
325 330 335 

Pro Glu Leu Ala Gly Lys Arg Met Lys He Glu He Ser Lys Thr Trp 
340 345 350 

Glu Asn Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn 
355 360 365 

Leu He Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu 
370 375 380 

Ser Asn He Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser He Val 
385 390 395 400 

He Asn Lys He Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe 
405 ~ 410 415 

Pro Leu Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr 
420 425 430 

Lys Gly Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin 
435 440 445 

He Glu Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys 
450 455 460 

Asp Met Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn 
465 470 * 475 480 

Glu Gly He Gly Lys Gin Tyr He Asn Ser He Glu Leu Ala He Lys 
485 490 495 

He Ala Val Asn Lys Asn Leu Asp Glu He Lys <31y His Thr Ala He 
500 505 510 

Phe Ser Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala 
515 520 525 

Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu 
530 535 540 

Gly Leu Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe 
545 * 550 555 ~ 560 



Ser Ser Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu 
565 ^ ~ 570 575 
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Pro Gly Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys 
580 585 590 

Gly Lys Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys lie Asp Glu 
595 * 600 605 

Trp Thr Lys Asn Lys Thr His Val Asp Asn lie Val lie Leu Ser Asp 
610 615 620 

Met Met He Ala Glu Gly Tyr Ser Asp He Asn Val Arg Gly Ser Ser 
625 630 635 640 

He Val Asn Ser He Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn He 
645 650 655 

Lys He Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu 
660 665 670 

Gly Asp Glu Phe Asn Glu Asn Asn Tyr He Lys He Phe Gly Met Ser 
675 680 685 

Asp Ser He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met 
690 695 700 

Val Glu Val He Lys Asn Phe Ala Leu Gin Lys He Gly Gin Lys 
705 710 715 

INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro He Gly Asn Glu 
1^5 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin 
20 25 30 

He Glu His Tyr Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu 
35 40 45 

Lys Leu Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 60 

Asp Asp Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65 ~ " " 70 75 80 

Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val 
85 * 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 * " 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 
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lie Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 ^ 160 

Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu lie Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser lie Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser lie Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 220 

Phe Ser Gin Thr lie Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

lie Asp Val Asn Phe Asp Asn Asn Leu Cys lie Leu Ala Leu Leu Arg 
245 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn lie Leu Asn lie Arg Ser Ser 
260 265 270 

Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys He Gly Glu Leu Leu Glu 
275 280 285 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
290 295 300 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 345 ~ 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala He Pro 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 

Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 
385 390 395 400 

Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn He Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser He Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 



Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 



17 



Thr Thr lie Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu lie Tyr Asp Ser Leu His Lys Leu Leu lie 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe lie Asp Leu Lys Asn lie Tyr Glu 
545 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn lie Ser Asn Pro His 
565 570 575 

Gly Asn lie Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 585 590 

Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys lie Glu 
610 615 ~ 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp lie Asp Ser Leu Cys Lys Ser 
625 630 635 640 

lie Ala Ser Cys Lys Asn Leu Gin Asn Val Asn lie lie Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn lie Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe- Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

lie Asn Cys He Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 

Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 715 w 720 

Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr He Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe He Lys Glu Ser Ser Gin 
770 775 780 



Thr Leu Gin Leu He Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 
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He Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 * 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 

Glu Glu He Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu 
835 840 ~ ~ 845 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 855 " 860 

Tyr Asp Tyr Asn Ser Asp Arg Trp 
865 870 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

U 

fa (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

D Asp He Asp Leu Asp Asp lie Glu Asn Leu Leu Pro Asn Thr Phe Asn 

•II Lys Tyr Ser Ser Ser Cys Ser Asp Lys Lys Gly Cys Lys Thr Leu Lys 

20 25 "* 30 

Mi 

|»S Ser Gly Ser Lys Ser Pro Ser Leu Thr He Pro Lys Leu Gin Lys Gin 

^ 35 40 45 

P Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp Ser Phe Leu 

50 55 60 

m Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu He Glu Thr 

n 65 70 75 so 

Hi I^u Leu Met 

(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asn Val Lys Ser Ala Lys He Glu Ser Ser Ser Leu Glu Ser Leu Glu 
15 10 15 

Asp He Asp Ser Leu Cys Lys Ser He Ala Ser Cys Lys Asn Leu Gin 
20 25 * 30 
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Asn Val Asn lie lie Ala Ser Leu Leu Tyr Pro Asn Asn lie Gin Lys 
35 40 45 

Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe Lys Gin Phe Glu Gin 
50 ~ 55 60 

Leu Lys Asn Leu Glu Asn Val Ser He Asn Cys He Leu Asp Gin His 
65 70 75 80 

He Leu Asn Ser He Ser Glu Phe Leu Glu Lys Asn Lys Lys He Lys 
85 90 95 

Ala Phe He Leu 
100 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Glu Met Asp He Asp Leu Asp Asp He Glu Asn Leu Leu Pro Asn 
1 5 10 15 

Thr Phe Asn Lys Tyr Ser Ser Ser Cys Ser Asp Lys Lys Gly Cys Lys 
20 25 30 

Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser Leu Thr He Pro Lys Leu 
35 40 45 

Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp 
50 55 60 

Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu 
65 70 75 80 

He Glu Thr Leu Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

He Glu Leu Ala He Lys He Ala Val Asn Lys Asn Leu Asp Glu He 
1 5 10 15 

Lys Gly His Thr Ala He Phe Ser Asp Val Ser Gly Ser Met Ser Thr 
20 25 30 

20 



Ser Met Ser Gly Gly Ala Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu 
35 40 45 

Glu Cys Ala Leu Val Leu Gly Leu Met Val Lys Gin Arg Cys Glu Lys 
50 55 €0 

Ser Ser Phe Tyr lie Phe Ser Ser Pro Ser Ser Gin Cys Lys Cys Tyr 
65 70 75 *" 80 

Leu Glu Val Asp Leu Pro Gly Asp Glu Leu Arg Pro Ser Met Gin Lys 
85 90 95 

Leu Leu 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr 
15 10 15 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
20 25 30 

Leu Leu Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys 
35 40 45 

Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser 
50 55 60 

Ser Leu Gly Phe Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Lys Asn Arg Asn Leu His Cys Thr Tyr lie Asp Tyr Lys Lys Ala Phe 
1 5 10 15 

Asp Ser lie Pro His Ser Trp Leu lie Gin Val Leu Glu lie Tyr Lys 
20 25 30 

lie Asn Arg Gin lie Ala lie Lys Lys Gly lie Tyr Gin Gly Asp Ser 
35 40 45 
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Leu Ser Pro Leu Trp Phe Cys Leu Ala Leu Asn Pro Leu Ser His Gin 
50 55 60 

Leu His Asn Asp Arg 
65 

INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Phe Gly Gly Ser Asn Trp Phe Arg Glu Val Asp Leu Lys Lys Cys Phe 
1 5 10 15 

Asp Thr lie Ser His Asp Leu lie lie Lys Glu Leu Lys Arg Tyr lie 
20 25 " 30 

Ser Asp His Val Pro Val Gly Pro Arg Val Cys Val Gin Gly Ala Pro 
35 40 45 

Thr Ser Pro Ala Leu Cys Asn Ala Val Leu Leu Arg Leu Asp Arg Arg 
50 55 60 

Leu Ala Gly Leu Ala 
65 

INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr 
1 5 10 * 15 

Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr 
20 25 30 

lie Pro Gly lie Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys 
35 40 45 

Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro 
50 55 60 

Phe Arg Lys Gin Asn 
65 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Val Leu Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr 
1 5 10 15 

Asp Ser He Pro Arg Met Glu Cys Met Arg He Leu Lys Asp Ala Leu 
20 25 30 

Lys Asn Lys Cys Tyr He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser 
35 40 45 

Leu Ser Ala Pro He Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe 
50 55 60 

Tyr Ser Glu Phe Lys 

C 65 Y 

□ 

□ (2) INFORMATION FOR SEQ ID NO: 18: 

Ml 

1» (i) SEQUENCE CHARACTERISTICS: 

g (A) LENGTH: 54 amino acids 

H r (B) TYPE: amino acid 

M (C) STRANDEDNESS: not relevant 

U (D) TOPOLOGY: not relevant 



□ 



rj 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu Asn 
15 10 15 

Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu Asn 
20 25 30 

Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Gin Asp Tyr Cys Asp 
35 40 45 

Trp He Gly He Ser He 
50 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



23 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

His Leu lie Tyr Met Asp Asp lie Lys Leu Tyr Ala Lys Asn Asp Lys 
1 5 10 15 

Glu Met Lys Lys Leu He Asp Thr Thr Thr He Phe Ser Asn Asp lie 
20 25 30 

Ser Met Gin Phe Gly Leu Asp Lys Cys Lys Thr Lys Cys Leu Tyr Lys 
35 40 45 

Tyr Leu Gly Phe Gin Gin 
50 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Tyr Val Arg Tyr Ala Asp Asp He Leu He Gly Val Leu Gly Ser Lys 
1 5 10 15 

Asn Lys He He Lys Arg Asp Leu Asn Asn Phe Leu Asn Ser Leu Gly 
20 25 30 

Leu Thr He Asn Glu Glu Lys Thr Leu He Glu Thr Pro Ala Arg Phe 
35 40 45 

Leu Gly Tyr Asn He 
50 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser His Leu Glu He 
1 5 10 15 

Gly His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp 
20 25 30 

Gly Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu 
35 40 45 

Trp Met Gly Tyr Glu Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

lie Leu Lys Leu Ala Asp Asp Phe Leu He He Ser Thr Asp Gin Gin 
1 S 10 15 

Gin Val He Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr 
20 25 " 30 

Asn Ala Lys Ala Asn Arg He Arg Ser Lys Ser Ser Lys Gly He Phe 
35 40 45 

Arg 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



K (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

P Leu Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn 

U 1 5 10 15 



Asp Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val 
20 25 30 

Glu He Glu Thr Leu Leu Met 



P 

\\» 35 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

lie Cys His Gin Glu Tyr Tyr Phe Gly Asp Phe Asn Leu Pro Arg Asp 
15 10 15 

Lys Phe Leu Lys <31u Gin lie Lys Leu Asp Glu Gly Trp Val Pro Leu 
20 25 30 

Glu He Met He Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

He Cys Glu Gin He Glu Tyr Tyr Phe Gly Asp His Asn Leu Pro Arg 
N 15 10 15 



B 

in 



m 



Asp Lys Phe Leu Lys Gin Gin He Leu Leu Asp Asp Gly Trp Val Pro 
20 25 30 



*|b Leu Glu Thr Met He Lys 



35 

(2) INFORMATION FOR SEQ ID NO: 26: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

He Leu Arg Gin Val Glu Tyr Tyr Phe Gly Asp Ala Asn Leu Asn Arg 
1 5 10 15 

Asp Lys Phe Leu Arg Glu Gin He Gly Lys Asn Glu Asp Gly Trp Val 
20 25 30 

Pro Leu Ser Val Leu Val Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Cys Leu Lys Gin Val Glu Phe Tyr Phe Ser Glu Phe Asn Phe Pro Tyr 
1 5 10 15 

Asp Arg Phe Leu Arg Thr Thr Ala Glu Lys Asn Asp Gly Trp Val Pro 
20 25 * * 30 

lie Ser Thr lie Ala Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TAGACCTGTT AGTGTACATT TGAATTGAAG C 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TAGACCTGTT AGGTTGGATT TGTGGCATCA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CAAAACCCCA AAACCTAACA GGTCTA 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GCGGGAATTC TAATACGACT CACTATAGGG AAGAAACTCT GATGAGGCCG AAAGGCCGAA 60 

ACTCCACGAA AGTGGAGTAA GTTTCTCGAT AATTGATCTG TAG 103 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « M DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

jff CGGGGATCCT CTTCAAAAGA TGAGAGGACA GCAAAC 36 

f! (2) INFORMATION FOR SEQ ID NO: 33: 

* (i) SEQUENCE CHARACTERISTICS: 

EJ1 (A) LENGTH: 60 base pairs 

U (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
C" (D) TOPOLOGY: linear 

m 

Q (ii) MOLECULE TYPE: other nucleic acid 

fjj (A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CCCCAAAACC CCAAAACCCC AAAACCCCCA CAGGGGTTTT GGGGTTTTGG GGTTTTGGGG 60 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CCAAAACCCC AAAACCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTTGGGG 58 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTTTGGGG TTTTGGGGTT TTGGGG 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT GGGG 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CCCCAAAACC CCAAAACCCC CACAGGGGTT TTGGGGTTTT GGGGTTTT 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ~ 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTTTGGGG TTTTGGGGTT TT 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear * 

<ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCCCAAAACC CCAAAACCCC CACAGGGGTT TTGGGGTTTT GGGGTTTT 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CCAAAACCCC AAAACCCCCA CAGGGGTTTT GGGGTTTTGG GGTTTT 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

AAAACCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTT 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "RNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CAAAACCCCA AAACC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA M 

(Xi) .SEQUENCE DESCRIPTION: SEQ ID NO: 44 

TTTTGGGG 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "RNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

CAAAACCCCA" AAACC 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 



GGGGTTTT 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

TCTRAARTAR TGDGTNADRT TRTTCAT 27 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GCGGATCCAT GAAYCCWGAR AAYCCWAAYG T 31 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

NNNGTNACHG GHATHAAYAA 20 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

DGCDGTYTCY TGRTCRTTRT A 21 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 








AACTCATTTA 


ATTACTAATT 


TAATCAACAA 


GATTGATAAA 


AAGCAGTAAA 


TAAAACCCAA 


fin 


TAGATTTAAT 


TTAGAAAGTA 


TCAATTGAAA 


AATGGAAATT 


GAAAAPAAPT 


AAGPAPAATA 




GCCAAAAGCC 


GAAAAATTGT 


GGTGGGAACT 


TGAATTAGAG 


ATGPAAGAAA 


APPAA A ATYia 


1 QA 


TATATAAGTT 


AGGGTTAAGA 


TTGACGATCC 


TAAGCAATAT 


CTCGTGAACG 


TPAPTGPACJP 


O AA 


ATGTTTGTTG 


TAGGAAGGTA 


GTTACTACTA 


AGATAAAGAT 


GAAAGAAGAT 


ATATPATPAP 


onn 
ouU 


TAAAGCACTT 


CTTGAGGTGG 


CTGAGTCTGA 


TCCTGAGTTC 


ATPTGPTAGT 


TGGPAGTPTA 


jOU 


CATCCGTAAT 


GAACTTTACA 


TCAGAACTAC 


CACTAACTAC 


ATTGTAGPAT 


X X XvjXvjX X vj X 




CCACAAGAAT 


ACTCAACCAT 


TCATCGAAAA 


GTACTTCAAC 


AAAGPAGTAP 


1 x 1 1 IjV_L» LAA 


480 


TGACTTACTG 


GAAGTCTGTG 


AATTTGCATA 


GGTTCTCTAT 


■t\X XXX X wAl w 


P A A f'TVtSTi & TT 
V»tu1V»XV3AjHlX X 


C A A 


CAAAAATTTG 


TATCTTGATA 


GGATACTTTC 


ATAAGATATT 


CGTAAGGAAP 


TP A PTTTP r*f2 
X LttL XXX LLu 


£ A A 


TAAGTGTTTA 


CAAAGATGCG 


TCAGAAGCAA 


GTTTTCTGAA 


TTCAACGAAT 


APTA APTTGG 


o o u 


TAAGTATTGC 


ACTGAATCCT 


AACGTAAGAA 


AACAATGTTC 


CGTTACCTCT 


CAGTTAPPAA 




CAAGTAAAAG 


TGGGATTAAA 


CTAAGAAGAA 


GAGAAAAGAG 


AATCTCTTAA 


HPAA APTTTA 


TO A 

# O U 


GGCAATAAAG 


GAATCTGAAG 


ATAAGTCCAA 


GAGAGAAACT 


GGAGACATAA 


TGAAPGTTGA 


Q AO 


AGATGCAATC 


AAGGCTTTAA 


AACCAGCAGT 


TATGAAGAAA 


ATAGCCAAGA 


GATAGAATGP 




CATGAAGAAA 


CACATGAAGG 


CACCTAAAAT 


TCCTAACTCT 


ACCTTGGAAT 


CAAAGTACTT 


960 


GACCTTCAAG 


GATCTCATTA 


AGTTCTGCCA 


TATTTCTGAG 


CCTAAAGAAA 


GAGTCTATAA 


1020 


GATCCTTGGT 


AAAAAATACC 


CTAAGACCGA 


AGAGGAATAC 


AAAGCAGCCT 


TTGGTGATTC 


1080 


TGCATCTGCA 


CCCTTCAATC 


CTGAATTGGC 


TGGAAAGCGT 


ATGAAGATTG 


AAATCTCTAA 


1140 


AACATGGGAA 


AATGAACTCA 


GTGCAAAAGG 


CAACACTGCT 


GAGGTTTGGG 


ATAATTTAAT 


1200 


TTCAAGCAAT 


TAACTCCCAT 


ATATGGCCAT 


GTTACGTAAC 


TTGTCTAACA 


TCTTAAAAGC 


1260 


CGGTGTTTCA 


GATACTACAC 


ACTCTATTGT 


GATCAACAAG 


ATTTGTGAGC 


CCAAGGCCGT 


1320 


TGAGAACTCC 


AAGATGTTCC 


CTCTTCAATT 


CTTTAGTGCC 


ATTGAAGCTG 


TTAATGAAGC 


1380 


AGTTACTAAG 


GGATTCAAGG 


CCAAGAAGAG 


AGAAAATATG 


AATCTTAAAG 


GTCAAATCGA 


1440 


AGCAGTAAAG 


GAAGTTGTTG 


AAAAAACCGA 


TGAAGAGAAG 


AAAGATATGG 


AGTTGGAGTA 


1500 
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AACCGAAGAA GGAGAATTTG 
CATTGAACTT GCAATCAAGA 
TGCAATCTTC TCTGATGTTT 
GTATGGTTCC GTTCGTACTT 
ACGTTGTGAA AAGTCCTCAT 
TTACTTAGAA GTTGATCTCC 
AGAGAAAGGA AAACTTGGTG 
AAAGAATAAA ACTCACGTAG 
ATATTCAGAT ATCAATGTTA 
TGAAGTAAAT CCTAACATTA 
TAATCTAGGT GATGAGTTCA 
AATCTTAAAG TTCATTTCAG 
CTTTGCCCTT CAAAAAATAG 
CTCACCCCAC TTTTTTGTTT 
ATTTAAGTTA CTTACATAGT 
AAAGAACAAA AAAGATTAAA 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp He Gin Val Arg 
1 5 10 15 

Val Lys He Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr Ala Ala 
20 25 30 

Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu Arg Arg 
35 40 45 

Tyr He He Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp Pro Glu 
50 55 60 

Phe He Cys Gin Leu Ala Val Tyr He Arg Asn Glu Leu Tyr He Arg 
65 70 75 80 

Thr Thr Thr Asn Tyr He Val Ala Phe Cys Val Val His Lys Asn Thr 
85 90 95 
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TTAAAGTCAA CGAAGGAATT GGCAAGCAAT ACATTAACTC 1560 

TAGCAGTTAA CAAGAATTTA GATGAAATCA AAGGACACAC 1620 

CTGGTTCTAT GAGTACCTCA ATGTCAGGTG GAGCCAAGAA 1680 

GTCTCGAGTG TGCATTAGTC CTTGGTTTGA TGGTAAAATA 1740 

TCTACATCTT CAGTTCACCT AGTTCTCAAT GCAATAAGTG 1800 

CTGGAGACGA ACTCCGTCCT TCTATGTAAA AACTTTTGCA 1860 

GTGGTACTGA TTTCCCCTAT GAGTGCATTG ATGAATGGAC 1920 

ACAATATCGT TATTTTGTCT GATATGATGA TTGCAGAAGG 1980 

GAGGCAGTTC CATTGTTAAC AGCATCAAAA AGTACAAGGA 2040 

AAATCTTTGC AGTTGACTTA GAAGGTTACG GAAAGTGCCT 2100 

ATGAAAACAA CTACATCAAG ATATTCGGTA TGAGCGATTC 2160 

CCAAGCAAGG AGGAGCAAAT ATGGTCGAAG TTATCAAAAA 2220 

GACAAAAGTG AGTTTCTTGA GATTCTTCTA TAACAAAAAT 2280 

TATTGCATAG CCATTATGAA ATTTAAATTA TTATCTATTT 2340 

TTATGTATCG CAGTCTATTA GCCTATTCAA ATGATTCTGC 2400 

A 2421 



Gin Pro Phe He Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn 
100 105 no 

Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr He Phe Asp 
115 120 125 

Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser Gin Asp 
130 135 140 

He Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg 
14 5 150 155 160 

Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr 
165 170 * 175 

Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val Thr Asn 
180 185 % 190 

Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn Leu Leu 
195 200 205 

Thr Lys Leu Gin Ala He Lys Glu Ser Glu Asp Lys Ser Lys Arg Glu 
210 215 220 

Thr Gly Asp He Met Asn Val Glu Asp Ala He Lys Ala Leu Lys Pro 
225 230 235 240 

Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys Lys His 
245 250 255 

Met Lys Ala Pro Lys He Pro Asn Ser Thr Leu Glu Ser Lys Tyr Leu 
260 265 270 

Thr Phe Lys Asp Leu He Lys Phe Cys His He Ser Glu Pro Lys Glu 
275 280 285 

Arg Val Tyr Lys He Leu Gly Lys Lys Tyr Pro Lys Thr Glu Glu Glu 
290 295 300 

Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn Pro Glu 
305 310 315 320 

Leu Ala Gly Lys Arg Met Lys He Glu He Ser Lys Thr Trp Glu Asn 
325 330 ~ 335 

Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn Leu He 

340 345 350 

Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu Ser Asn 
355 360 365 

He Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser He Val He Asn 
370 375 380 

Lys He Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe Pro Leu 
385 390 395 400 
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Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr Lys Gly 
405 410 415 

Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin He Glu 
420 ~ 425 430 

Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys Asp Met 
435 440 445 

Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn Glu Gly 
450 455 460 

He Gly Lys Gin Tyr He Asn Ser He Glu Leu Ala He Lys He Ala 
465 470 475 480 

Val Asn Lys Asn Leu Asp Glu He Lys Gly His Thr Ala He Phe Ser 
485 490 405 

Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala Lys Lys 
500 505 * 510 

Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu Gly Leu 
515 520 525 

Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe Ser Ser 
530 535 540 

Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu Pro Gly 
545 550 555 560 

Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys Gly Lys 
565 570 575 

Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys He Asp Glu Trp Thr 
580 585 590 

Lys Asn Lys Thr His Val Asp Asn He Val He Leu Ser Asp Met Met 
595 600 605 

He Ala Glu Gly Tyr Ser Asp He Asn Val Arg Gly Ser Ser He Val 
610 615 620 

Asn Ser He Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn He Lys He 
625 630 635 640 

Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu Gly Asp 
645 650 655 

Glu Phe Asn Glu Asn Asn Tyr He Lys He Phe Gly Met Ser Asp Ser 
660 665 670 

He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met Val Glu 
675 680 685 

Val He Lys Asn Phe Ala Leu Gin Lys He Gly 
690 695 



INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2829 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

TCAATACTAT TAATTAATAA ATAAAAAAAA GCAAACTACA AAGAAAATGT CAAGGCGTAA 60 

CTAAAAAAAG CCATAGGCTC CTATAGGCAA TGAAACAAAT CTTGATTTTG TATTACAAAA 120 

TCTAGAAGTT TACAAAAGCC AGATTGAGCA TTATAAGACC TAGTAGTAAT AGATCAAAGA 180 

GGAGGATCTC AAGCTTTTAA AGTTCAAAAA TTAAGATTAG GATGGAAACT CTGGCAACGA 240 

TGATGATGAT GAAGAAAACA ACTCAAATAA ATAATAAGAA TTATTAAGGA GAGTCAATTA 300 

GATTAAGTAG CAAGTTTAAT TGATAAAAAA AGTTGGTTCT AAGGTAGAGA AAGATTTGAA 360 

TTTGAACGAA GATGAAAACA AAAAGAATGG ACTTTCTGAA TAGCAAGTGA AAGAAGAGTA 420 

ATTAAGAACG ATTACTGAAG AATAGGTTAA GTATTAAAAT TTAGTATTTA ACATGGACTA 480 

CCAGTTAGAT TTAAATGAGA GTGGTGGCCA TAGAAGACAC AGAAGAGAAA CAGATTATGA 540 

□ 

[p| TACTGAAAAA TGGTTTGAAA TATCTCATGA CCAAAAAAAT TATGTATCAA TTTACGCCAA 600 

CTAAAAGACA TCATATTGTT GGTGGCTTAA AGATTATTTT AATAAAAACA ATTATGATCA 660 

f*ft TCTTAATGTA AGCATTAACA GACTAGAAAC TGAAGCCGAA TTCTATGCCT TTGATGATTT 720 

TTCACAAACA ATCAAACTTA CTAATAATTC TTACTAGACT GTTAACATAG ACGTTAATTT 780 

TGATAATAAT CTCTGTATAC TCGCATTGCT TAGATTTTTA TTATCACTAG AAAGATTCAA 840 

TATTTTGAAT ATAAGATCTT CTTATACAAG AAATTAATAT AATTTTGAGA AAATTGGTGA 900 

m 

GCTACTTGAA ACTATCTTCG CAGTTGTCTT TTCTCATCGC CACTTACAAG GCATTCATTT 960 

fjj ACAAGTTCCT TGCGAAGCGT TCTAATATTT AGTTAACTCC TCATCATAAA TTAGCGTTAA 1020 

AGATAGCTAA TTATAGGTAT ACTCTTTCTC TACAGACTTA AAATTAGTTG ACACTAACAA 1080 

AGTCCAAGAT TATTTTAAGT TCTTATAAGA ATTCCCTCGT TTGACTCATG TAAGCTAGTA 1140 

GGCTATCCCA GTTAGTGCTA CTAACGCTGT AGAGAACCTC AATGTTTTAC TTAAAAAGGT 1200 

CAAGCATGCT AATCTTAATT TAGTTTCTAT CCCTACCTAA TTCAATTTTG ATTTCTACTT 1260 

TGTTAATTTA TAACATTTGA AATTAGAGTT TGGATTAGAA CCAAATATTT TGACAAAACA 1320 

AAAGCTTGAA AATCTACTTT TGAGTATAAA ATAATCAAAA AATCTTAAAT TTTTAAGATT 1380 

AAACTTTTAC ACCTACGTTG CTTAAGAAAC CTCCAGAAAA CAGATATTAA AACAAGCTAC 1440 

AACAATCAAA AATCTCAAAA ACAATAAAAA TCAAGAAGAA ACTCCTGAAA CTAAAGATGA 1500 

AACTCCAAGC GAAAGCACAA GTGGTATGAA ATTTTTTGAT CATCTTTCTG AATTAACCGA 1560 

GCTTGAAGAT TTCAGCGTTA ACTTGTAAGC TACCCAAGAA ATTTATGATA GCTTGCACAA 1620 
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ACTTTTGATT 


AGATCAACAA 


ATTTAAAGAA GTTCAAATTA AGTTACAAAT ATGAAATGGA 


1680 


AAAGAGTAAA 


ATGGATACAT 


TCATAGATCT TAAGAATATT TATGAAACCT TAAACAATCT 


1740 


TAAAAGATGC 


TCTGTTAATA 


TATCAAATCC TCATGGAAAC ATTTCTTATG AACTGACAAA 


1800 


TAAAGATTCT 


ACTTTTTATA 


AATTTAAGCT GACCTTAAAC TAAGAATTAT AACACGCTAA 


1860 


GTATACTTTT 


AAGTAGAACG 


AATTTTAATT TAATAACGTT AAAAGTGCAA AAATTGAATC 


1920 


TTCCTCATTA 


GAAAGCTTAG 


AAGATATTGA TAGTCTTTGC AAATCTATTG CTTCTTGTAA 


1980 


AAATTTACAA 


AATGTTAATA 


TTATCGCCAG TTTGCTCTAT CCCAACAATA TTTAGAAAAA 


2040 


TCCTTTCAAT 


AAGCCCAATC 


TTCTATTTTT CAAGCAATTT GAATAATTGA AAAATTTGGA 


2100 


AAATGTATCT 


ATCAACTGTA 


TTCTTGATCA GCATATACTT AATTCTATTT CAGAATTCTT 


2160 


AGAAAAGAAT 


AAAAAAATAA 


AAGCATTCAT TTTGAAAAGA TATTATTTAT TACAATATTA 


2220 


TCTTGATTAT 


ACTAAATTAT 


TTAAAACACT TCAATAGTTA CCTGAATTAA ATTAAGTTTA 


2280 


CATTAATTAG 


CAATTAGAAG 


AATTGACTGT GAGTGAAGTA CATAAGTAAG TATGGGAAAA 


2340 


CCACAAGCAA 


AAAGCTTTCT 


ATGAACCATT ATGTGAGTTT ATCAAAGAAT CATCCTAAAC 


2400 


CCTTTAGCTA 


ATAGATTTTG 


ACCAAAACAC TGTAAGTGAT GACTCTATTA AAAAGATTTT 


2460 


AGAATCTATA 


TCTGAGTCTA 


AGTATCATCA TTATTTGAGA TTGAACCCTA GTTAATCTAG 


2520 


CAGTTTAATT 


AAATCTGAAA 


ACGAAGAAAT TTAAGAACTT CTCAAAGCTT GCGACGAAAA 


2580 


AGGTGTTTTA 


GTAAAAGCAT 


ACTATAAATT CCCTCTATGT TTACCAACTG GTACTTATTA 


2640 


CGATTACAAT 


TCAGATAGAT 


GGTGATTAAT TAAATATTAG TTTAAATAAA TATTAAATAT 


2700 


TGAATATTTC 


TTTGCTTATT 


ATTTGAATAA TACATACAAT AGTCATTTTT AGTGTTTTGA 


2760 


ATATATTTTA 


GTTATTTAAT 


TCATTATTTT AAGTAAATAA TTATTTTTCA ATCATTTTTT 


2820 


AAAAAATCG 






2829 



(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro He Gly Asn Glu 
15 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin 
20 25 30 

He Glu His Tyr Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu 
35 40 45 

38 



Lys Leu Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 60 

Asp Asp Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65 70 75 80 

Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val 
85 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 * 125 

He Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 ~ " 160 

Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser He Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 * 240 

He Asp Val Asn Phe Asp Asn Asn Leu Cys He Leu Ala Leu Leu Arg 
245 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn He Leu Asn He Arg Ser Ser 
260 265 270 

Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys He Gly Glu Leu Leu Glu 
275 280 285 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
290 295 300 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 345 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala He Pro 
355 360 365 



Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 
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Val Lys His Ala Asn Leu Asn Leu Val Ser lie Pro Thr Gin Phe Asn 
385 390 395 400 

Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn lie Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 ^ 430 

Ser He Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 

Thr Thr He Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 ~ 510 

Leu Gin Ala Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu He 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe He Asp Leu Lys Asn He Tyr Glu 
545 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn He Ser Asn Pro His 
565 570 575 

Gly Asn He Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 585 590 

Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
610 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 630 635 640 

He Ala Ser Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

He Asn Cys He Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 

Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 715 ' 720 
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Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr He Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe He Lys Glu Ser Ser Gin 
770 775 780 

Thr Leu Gin Leu He Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 

lie Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 

Glu Glu He Gin Glu Leu Leu Lys /Ala Cys Asp Glu Lys Gly Val Leu 
835 840/ 845 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 855 860 

Tyr Asp Tyr Asn Ser Asp Arg Trp 
865 870 

INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys He Leu Phe Glu Phe He Gin Asp Lys Leu Asp He Asp Leu 
15 10 15 

Gin Thr Asn Ser Thr Tyr Lys Glu Asn Leu Lys Cys Gly His Phe Asn 
20 25 30 

Gly Leu Asp Glu He Leu Thr Thr Cys Phe Ala Leu Pro Asn Ser Arg 
35 40 45 

Lys He Ala Leu Pro Cys Leu Pro Gly Asp Leu Ser His Lys Ala Val 
50 55 60 

He Asp His Cys He He Tyr Leu Leu Thr Gly Glu Leu Tyr Asn Asn 
65 70 75 80 

Val Leu Thr Phe Gly Tyr Lys He Ala Arg Asn Glu Asp Val Asn Asn 
85 90 95 

Ser Leu Phe Cys His Ser Ala Asn Val Asn Val Thr Leu Leu Lys Gly 
100 105 HO 



41 



Ala Ala Trp Lys Met Phe His Ser Leu Val Gly Thr Tyr Ala Phe Val 
115 120 4 125 

Asp Leu Leu lie Asn Tyr Thr Val He Gin Phe Asn Gly Gin Phe Phe 
130 135 140 

Thr Gin He Val Gly Asn Arg Cys Asn Glu Pro His Leu Pro Pro Lys 
"5 150 155 160 

Trp Val Gin Arg Ser Ser Ser Ser Ser Ala Thr Ala Ala Gin He Lys 
165 170 175 

Gin Leu Thr Glu Pro Val Thr Asn Lys Gin Phe Leu His Lys Leu Asn 
180 185 190 

He Asn Ser Ser Ser Phe Phe Pro Tyr Ser Lys He Leu Pro Ser Ser 
195 200 205 

Ser Ser He Lys Lys Leu Thr Asp Leu Arg Glu Ala He Phe Pro Thr 
210 215 220 

Asn Leu Val Lys He Pro Gin Arg Leu Lys Val Arg He Asn Leu Thr 
225 230 235 240 

Leu Gin Lys Leu Leu Lys Arg His Lys Arg Leu Asn Tyr Val Ser He 
245 250 * 255 

Leu Asn Ser He Cys Pro Pro Leu Glu Gly Thr Val Leu Asp Leu Ser 
260 265 270 

His Leu Ser Arg Gin Ser Pro Lys Glu Arg Val Leu Lys Phe He He 
275 280 285 

Val He Leu Gin Lys Leu Leu Pro Gin Glu Met Phe Gly Ser Lys Lys 
290 295 300 

Asn Lys Gly Lys He He Lys Asn Leu Asn Leu Leu Leu Se? Leu Pro 
305 310 315 320 

Leu Asn Gly Tyr Leu Pro Phe Asp Ser Leu Leu Lys Lys Leu Arg Leu 
325 330 " 335 

Lys Asp Phe Arg Trp Leu Phe He Ser Asp He Trp Phe Thr Lys His 
340 345 350 

Asn Phe Glu Asn Leu Asn Gin Leu Ala He Cys Phe He Ser Trp Leu 
355 360 " 365 

Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr Cys Thr 
370 375 380 

Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr Trp 
385 390 395 400 

Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr Leu 
'405 410 415 

Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn 
420 425 430 

Phe Asn His Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn Glu 
435 440 445 
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Phe Arg lie He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
450 455 460 

Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin Lys 
4g 5 470 475 480 

He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys He 
485 490 495 

Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin Arg 
500 505 510 

Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met Lys 
515 520 525 

Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu Cys Met 
530 535 540 

Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val Arg 
545 550 555 560 

Ser Gin Tyr Phe Phe Asn Thr Asn Thr Gly Val Leu Lys Leu Phe Asn 
565 570 575 

Val Val Asn Ala Ser Arg Val Pro Lys Pro Tyr Glu Leu Tyr He Asp 
580 585 * 590 

Asn Val Arg Thr Val His Leu Ser Asn Gin Asp Val He Asn Val Val 
595 600 ^ 605 

Glu Met Glu He Phe Lys Thr Ala Leu Trp Val Glu Asp Lys Cys Tyr 
610 615 620 

He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro He 
625 630 635 640 

Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys 
645 650 * 655 

Ala Ser Pro Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe 
660 665 " 670 

Leu He He Ser Thr Asp Gin Gin Gin Val He Asn He Lys Lys Leu 
675 680 685 

Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys 
690 695 700 

He Leu Ala Val Ser Ser Gin Ser Asp Asp Asp Thr Val He Gin Phe 
705 710 715 720 

Cys Ala Met His He Phe Val Lys Glu Leu Glu Val Trp Lys His Ser 
725 730 735 

Ser Thr Met Asn Asn Phe His He Arg Ser Lys Ser Ser Lys Gly He 
740 745 " 750 

Phe Arg Ser Leu He Ala Leu Phe Asn Thr Arg He Ser Tyr Lys Thr 
755 760 765 

He Asp Thr Asn Leu Asn Ser Thr Asn Thr Val Leu Met Gin He Asp 
770 775 780 
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His Val 
785 



Val Lys Asn He Ser Glu Cys Tyr Lys Ser Ala Phe Lys Asp 
790 795 800 



Leu Ser 



He Asn Val Thr Gin Asn Met Gin Phe His Ser Phe Leu Gin 
805 810 815 



Arg He 



He Glu Met Thr Val Ser Gly Cys Pro He Thr Lys Cys Asp 
820 825 830 



Pro Leu 



He Glu Tyr Glu Val Arg Phe Thr He Leu Asn Gly Phe Leu 
835 ^ 840 845 



Glu Ser 
850 



Leu Ser Ser Asn Thr Ser Lys Phe Lys Asp Asn He He Leu 
855 860 



Leu Arg 
865 



Lys Glu He Gin His Leu Gin Ala Tyr He Tyr He Tyr He 
870 875 880 



His He Val Asn 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc » "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

YARACHAARG GHATYCCHYA RGG 23 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

DGTDATNARN ARRTARTCRT C 21 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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Leu Cys Val Ser Tyr lie Leu Ser Ser Phe Tyr Tyr Ala Asn Leu Glu 
15 10 15 

Glu Asn Ala Leu Gin Phe Leu Arg Lys Glu Ser Met Asp Pro Glu Lys 
20 25 30 , 

Pro Glu Thr Asn Leu Leu Met Arg Leu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu Cys Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu 
15 10 15 

Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu Asn 
20 " 25 30 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "RNA" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 12.. 25 

(D) OTHER INFORMATION: /mod_base=* OTHER 
/note= "The residues located at these positions are 
2-O-methylribonucleoti " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

TAGACCTGTT AGGUUUUGGG GUUUUG 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGGGTTTTGG GGTTTT 16 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..389 

(D) OTHER INFORMATION: /note= "expressed sequence tag (EST) 

AA281296" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 
TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 
! AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGACGTG 180 

! TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 240 

CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA CGTCGTGGGA 300 
GCCAGAACGT TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 
TTCAGCGTGC TCAACTACGA GCGGGCGCG 3 89 
(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..233 

(D) OTHER INFORMATION: /note* "TRT motifs from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

He Ser Glu He Glu Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys 
1 5 10 15 

Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin He Phe Ala Glu Phe 
20 25 30 
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lie Tyr Trp Leu Tyr Asn Ser Phe He He Pro He Leu Gin Ser Phe 
35 40 45 

Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe 
50 55 60 

Arg Lys Asp He Trp Lys Leu Leu Cys Arg Pro Phe He Thr Ser Met 
65 70 75 80 

Lys Met Glu Ala Phe Glu Lys He Asn Glu Asn Asn Val Arg Met Asp 
85 90 95 

Thr Gin Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys 
100 105 110 

Lys Asn Thr Phe Arg Leu He Thr Asn Leu Arg Lys Arg Phe Leu He 
115 120 125 

Lys Met Gly Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu 
130 135 140 

Arg Pro Val Ala Ser He Leu Lys His Leu He Asn Glu Glu Ser Ser 
145 150 155 160 

Gly He Pro Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys 
165 170 175 



Q Lys Asp Leu Leu Lys His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val 

Ul. 180 185 190 

■JR 

m Arg He Asp He Lys Ser Cys Tyr Asp Arg He Lys Gin Asp Leu Met 
~ 195 200 205 

M Phe Arg He Val Lys Lys Lys Leu Lys Asp Pro Glu Phe Val He Arg 

P 210 215 220 

,T Lys Tyr Ala Thr He His Ala Thr Ser 

h ft 225 230 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..233 

(D) OTHER INFORMATION: /notes "TRT motifs from Saccharomyces 

cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Lys Asp Phe Arg Trp Leu Phe He Ser Asp He Trp Phe Thr Lys 
1 5 * 10 15 
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His Asn Phe Glu Asn Leu Asn Gin Leu Ala lie Cys Phe He Ser Trp 
20 25 30 

Leu Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr Cys 
35 40 45 

Thr Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr 
50 55 60 

Trp Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr 
65 70 75 * - 80 

Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser 
85 90 95 

Asn Phe Asn His Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn 
100 105 110 

Glu Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu 
115 120 125 

Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin 
130 135 140 

Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys 
145 150 * 155 160 

He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin 
165 170 175 

Arg Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met 
180 185 190 

Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu Cys 
155 200 205 

Met Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val 
210 215 220 

Arg Ser Gin Tyr Phe Phe Asn Thr Asn 
225 230 

(2) INFORMATION FOR SEQ ID NO : 65 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..233 

(D) OTHER INFORMATION: /note= "TRT motifs from Euplotes 

aediculatus pl23 n 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 65: 



48 



IlHEBSllti! JLUMIIHUJIIM! JI1B 



Thr Arg Glu lie Ser Trp Met Gin Val Glu Thr Ser Ala Lys His Phe 
15 10 15 

Tyr Tyr Phe Asp His Glu Asn lie Tyr Val Leu Trp Lys Leu Leu Arg 
20 25 30 

Trp lie Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
35 40 45 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
50 55 60 

Asn He Trp Asp Val He Met Lys Met Ser He Ala Asp Leu Lys Lys 
65 70 75 80 

Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu Trp Lys Lys 
85 90 95 

Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He Pro Lys Lys Thr 
100 105 110 

Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He Val Asn Ser Asp 
11S 120 125 

Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu Asn Ser His 
130 135 140 

Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp Pro Phe Gly 
US 145 ^ 150 155 160 

m Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr Glu Glu Phe 

?! 165 170 175 

K Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe Phe Ala Thr 

^ 180 185 190 



a 



W 
D 



Met Asp He Glu Lys Cys Tyr Asp Ser Val Asn Arg Glu Lys Leu Ser 
195 200 205 

Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser Ser Asp Phe Trp He Met 
210 215 220 

Thr Ala Gin He Leu Lys Arg Lys Asn 
225 230 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2631 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .2631 

(D) OTHER INFORMATION: /note* "Saccharomyces cerevisiae 

EST2 " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 








ATTTATACTC 


ATGAAAATCT 


TATTCGAGTT 


CATTCAAGAC 


AAGCTTGACA 


TTGATCTACA 


60 


GACCAACAGT 


ACTTACAAAG 


AAAATTTAAA 


ATGTGGTCAC 


TTCAATGGCC 


TCGATGAAAT 


120 


TCTAACTACG 


TGTTTCGCAC 


TACCAAATTC 


AAGAAAAATA 


GCATTACCAT 


GCCTTCCTGG 


180 


TGACTTAAGC 


CACAAAGCAG 


TCATTGATCA 


CTGCATCATT 


TACCTGTTGA 


CGGGCGAATT 


240 


ATACAACAAC 


GTACTAACAT 


TTGGCTATAA 


AATAGCTAGA 


AATGAAGATG 


TCAACAATAG 


300 


TCTTTTTTGC 


CATTCTGCAA 


ATGTTAACGT 


TACGTTACTG 


AAAGGCGCTG 


CTTGGAAAAT 


360 


GTTCCACAGT 


TTGGTCGGTA. 


CATACGCATT 


CGTTGATTTA 


TTGATCAATT 


ATACAGTAAT 


420 


TCAATTTAAT 


GGGCAGTTTT 


TCACTCAAAT 


CGTGGGTAAC 


AGATGTAACG 


AACCTCATCT 


480 


GCCGCCCAAA 


TGGGTCCAAC 


GATCATCCTC 


ATCATCCGCA 


ACTGCTGCGC 


AAATCAAACA 


540 


ACTTACAGAA 


CCAGTGACAA 


ATAAACAATT 


CTTACACAAG 


CTCAATATAA 


ATTCCTCTTC 


600 


TTTTTTTCCT 


TATAGCAAGA 


TCCTTCCTTC 


ATCATCATCT 


ATCAAAAAGC 


TAACTGACTT 


660 


GAGAGAAGCT 


ATTTTTCCCA 


CAAATTTGGT 


TAAAATTCCT 


CAGAGACTAA 


AGGTACGAAT 


720 


TAATTTGACG 


CTGCAAAAGC 


TATTAAAGAG 


ACATAAGCGT 


TTGAATTACG 


TTTCTATTTT 


780 


GAATAGTATT 


TGCCCACCAT 


TGGAAGGGAC 


CGTATTGGAC 


TTGTCGCATT 


TGAGTAGGCA 


840 


ATCACCAAAG 


GAACGAGTCT 


TGAAATTTAT 


CATTGTTATT 


TTACAGAAGT 


TATTACCCCA 


900 


AGAAATGTTT 


GGCTCAAAGA 


AAAATAAAGG 


AAAAATTATC 


AAGAATCTAA 


ATCTTTTATT 


960 


AAGTTTACCC 


TTAAATGGCT 


ATTTACCATT 


TGATAGTTTG 


TTGAAAAAGT 


TAAGATTAAA 


1020 


GGATTTTCGG 


TGGTTGTTCA 


TTTCTGATAT 


TTGGTTCACC 


AAGCACAATT 


TTGAAAACTT 


1080 


GAATCAATTG 


GCGATTTGTT 


TCATTTCCTG 


GCTATTTAGA 


CAACTAATTC 


CCAAAATTAT 


1140 


ACAGACTTTT 


TTTTACTGCA 


CCGAAATATC 


TTCTACAGTG 


ACAATTGTTT 


ACTTTAGACA 


1200 


TGATACTTGG 


AATAAACTTA 


TCACCCCTTT 


TATCGTAGAA 


TATTTTAAGA 


CGTACTTAGT 


1260 


CGAAAACAAC 


GTATGTAGAA 


ACCATAATAG 


TTACACGTTG 


TCCAATTTCA 


ATCATAGCAA 


1320 


AATGAGGATT 


ATACCAAAAA 


AAAGTAATAA 


TGAGTTCAGG 


ATTATTGCCA 


TCCCATGCAG 


1380 


AGGGGCAGAC 


GAAGAAGAAT 


TCACAATTTA 


TAAGGAGAAT 


CACAAAAATG 


CTATCCAGCC 


1440 


CACTCAAAAA 


ATTTTAGAAT 


ACCTAAGAAA 


CAAAAGGCCG 


ACTAGTTTTA 


CTAAAATATA 


1500 


TTCTCCAACG 


CAAATAGCTG 


ACCGTATCAA 


AGAATTTAAG 


LAVjAuAL XXI 


X AAACj AAA X X 


1560 


TAATAATGTC 


TTACCAGAGC 


TTTATTTCAT 


GAAATTTGAT 


GTCAAATCTT 


GCTATGATTC 


1620 


CATACCAAGG 


ATGGAATGTA 


TGAGGATACT 


CAAGGATGCG 


CTAAAAAATG 


AAAATGGGTT 


1680 


TTTCGTTAGA 


TCTCAATATT 


TCTTCAATAC 


CAATACAGGT 


GTATTGAAGT 


TATTTAATGT 


1740 


TGTTAACGCT 


AGCAGAGTAC 


CAAAACCTTA 


TGAGCTATAC 


ATAGATAATG 


TGAGGACGGT 


1800 
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TCATTTATCA 


AATCAGGATG 


TTATAAACGT TGTAGAGATG 


GAAATATTTA 


AAACAGCTTT 


1860 


GTGGGTTGAA 


GATAAGTGCT 


ACATTAGAGA AGATGGTCTT 


TTTCAGGGCT 


CTAGTTTATC 


1920 


TGCTCCGATC 


GTTGATTTGG 


TGTATGACGA TCTTCTGGAG 


TTTTATAGCG 


AGTTTAAAGC 


1980 


CAGTCCTAGC 


CAGGACACAT 


TAATTTTAAA ACTGGCTGAC 


GATTTCCTTA 


TAATATCAAC 


2040 


AGACCAACAG 


CAAGTGATCA 


ATATCAAAAA GCTTGCCATG 


GGCGGATTTC 


AAAAATATAA 


2100 


TGCGAAAGCC 


AATAGAGACA 


AAATTTTAGC CGTAAGCTCC 


CAATCAGATG 


ATGATACGGT 


2160 


TATTCAATTT 


TGTGCAATGC 


ACATATTTGT TAAAGAATTG 


GAAGTTTGGA 


AACATTCAAG 


2220 


CACAATGAAT 


AATTTCCATA 


TCCGTTCGAA ATCTAGTAAA 


GGGATATTTC 


GAAGTTTAAT 


2280 


AGCGCTGTTT 


AACACTAGAA 


TCTCTTATAA AACAATTGAC 


ACAAATTTAA 


ATTCAACAAA 


2340 


CACCGTTCTC 


ATGCAAATTG 


ATCATGTTGT AAAGAACATT 


TCGGAATGTT 


ATAAATCTGC 


2400 


TTTTAAGGAT 


CTATCAATTA 


ATGTTACGCA AAATATGCAA 


TTTCATTCGT 


TCTTACAACG 


2460 


CATCATTGAA 


ATGACAGTCA 


GCGGTTGTCC AATTACGAAA 


TGTGATCCTT 


TAATCGAGTA 


2520 


TGAGGTACGA 


TTCACCATAT 


TGAATGGATT TTTGGAAAGC 


CTATCTTCAA 


ACACATCAAA 


2580 


ATTTAAAGAT 


AATATCATTC 


TTTTGAGAAA GGAAATTCAA 


CACTTGCAAG 


C 


2631 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..129 

(D) OTHER INFORMATION: /note= H TRT motifs from human" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
15 10 15 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu 
20 25 30 

Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He 
35 40 45 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu 
50 55 60 

Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu 
65 70 75 80 
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Arg Phe He Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp 
85 90 95 

Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg 
100 105 110 

Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg 
115 120 125 

Ala 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

M (A) NAME/KEY: CDS 

□ (B> LOCATION: join (959 1216 , 1273.. 1353, 1425.. 1543, 

pi 1595.. 1857, 1894.. 2286, 2326. .2396, 2436.. 2705, 

*J 2746.. 2862, 2914.. 3083, 3125.. 3309, 3356.. 3504, 

Hi 3546.. 3759, 3797.. 4046, 4086.. 4252, 4296.. 4392, 

4% 4435. .4597) 

91 (W OTHER INFORMATION : /note- "Schizosaccharomyces pombe 

U telomerase catalytic subunit (TRT) n 



K 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GGTACCGATT TACTTTCCTT TCTTCATAAG CTAATTGCTT CCTCGAACGC TCCTAAATCT 60 

\f : CTGGAAATAT TTTTACAAGA ACTCAATAAC AATACCAAGT CAAATTCCAA TATGAAGGTG 120 

TTATTAGTGA TCGATAATAT TTCTATTTTA TCGGTCGTTA CCAAGTATAA GGACAAAAAG 180 

AACAACTTCC TTCCCCCTAA AGACTTTTAC TTTATTAATT TACTTTTCAA ATATATTTCG 240 

GGTTCGCTTA CTTTTAATCG TGGTACTGTT TTAGCTGCTA CTTCTAGCCA ACCGCGTGTT 300 

TCTACCCCGT CATTGGATAT AGCTCTTGGA GTAGCTCACA GAAATCCTTA CAAATCTTCT 360 

GATGAGACTA TATTAGATTC ATTACAGTCC GTGCATATTC TTAACATGGA GCCTTACACT 420 

TTAGATGAGT CACGTCGCAT GATGGAGTAT TTGGTATCAT CCAACGTTTG CCTTGAAAAG 480 

GTTGATAATT ATTTGCAAAA TCATGTCCTT AGTGGTGGTA ATCCGCGAAA GTTTTTTGAT 540 

GCTTGCACAC GTCTAGCATG ATTGAGATAT TCAAAAATTT CTATCCACTA CAACTCCTTT 600 

AACGCGGTTT TATTTTTCTA TTTTCTATTC TCATGTTGTT CCAAATATGT ATCATCTCGT 660 

ATTAGGCTTT TTTCCGTTTT ACTCCTGGAA TCGTACCTTT TTCACTATTC CCCCTAATGA 720 

ATAATCTAAA TTAGTTTCGC TTATAATTGA TA<3TAGTAGA AAGATTGGTG ATTCTACTCG 780 
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T6TAATGTTA TTAGTTTAAA GATACTTTGC AAAACATTTA TTAGCTATCA TTATATAAAA 840 

AAAATCCTAT AATTATAAAT ATTAATCAAT ATTTGCGGTC ACTATTTATT TAAAACGTTA 900 

TGATCAGTAG GACACTTTGC ATATATATAG TTATGCTTAA TGGTTACTTG TAACTTGC 958 

ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG 1006 
Met Thr Glu His His Thr Pro Lys Ser Arg lie Leu Arg Phe Leu Glu 
1 5 io 15 

AAT CAA TAT GTA TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT 1054 
Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 * 30 

TTG AGA GGG TCG CCG GCA AGC TCG TAT AGC AAT ATA TGC GAA CGC TTG 1102 
Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn He Cys Glu Arg Leu 
35 40 45 

AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT CAT TCG ACT GTA 1150 
Arg Ser Asp Val Gin Thr Ser Phe Ser He Phe Leu His Ser Thr Val 
50 55 60 

GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

AAA TGC TCA CAG TCA GAG GTATATATAT TTTTGTTTTG ATTTTTTTCT 1246 
Lys Cys Ser Gin Ser Glu 
85 



gij ATTCGGGATA GCTAATATAT GGGCAG CTA ATA GCG AAT GTT GTA AAA CAG ATG 1299 

U ^eu He Ala Asn Val Val Lys Gin Met 

ir TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA CTG ATG AAA GGG TTT 1347 

Phe Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe 
100 105 110 

TCC ATG GTAAGGTATT CTAATTGTGA AATATTTACC TGCAATTACT GTTTCAAAGA 1403 

03' Ser Met 



GATTGTATTT AACCGATAAA G AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA 1454 

Asn His Glu Asp Phe Arg Ala Met His Val 
115 120 

AAC GGA GTA CAA AAT GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA 1502 
Asn Gly Val Gin Asn Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu He 
125 130 135 

TCT ATA CTT GAG TCA AAA AAT TGG CAA CTT TTG TTA GAA AT 1543 
Ser He Leu Glu Ser Lys Asn Trp Gin Leu Leu Leu Glu He 
140 145 150 

GTAAATACCG GTTAAGATGT TGCGCACTTT GAACAAGACT GACAAGTATA G T ATC 1598 

He 

GGC AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG 1646 
Gly Ser Asp Ala Met His Tyr Leu Leu Ser Lys Gly Ser He Phe Glu 
155 160 165 170 
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GCT CTT CCA AAT GAC AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT 1694 
Ala Leu Pro Asn Asp Asn Tyr Leu Gin He Ser Gly He Pro Leu Phe 
175 180 * 185 

AAA AAT AAT GTG TTT GAG GAA ACT GTG TCA AAA AAA AGA AAG CGA ACC 1742 
Lys Asn Asn Val Phe Glu Glu Thr Val Ser Lys Lys Arg Lys Arg Thr 
190 195 200 

ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA GAA GTT TCC 1790 
He Glu Thr Ser He Thr Gin Asn Lys Ser Ala Arg Lys Glu Val Ser 
205 210 215 

TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC 1838 
Trp Asn Ser He Ser He Ser Arg Phe Ser He Phe Tyr Arg Ser Ser 
220 225 230 

TAT AAG AAG TTT AAG CAA G GTAACTAATA CTGTTATCCT TCATAACTAA 1887 
Tyr Lys Lys Phe Lys Gin 
235 240 

TTTTAG AT CTA TAT TTT AAC TTA CAC TCT ATT TGT GAT CGG AAC ACA 1934 
Asp Leu Tyr Phe Asn Leu His Ser He Cys Asp Arg Asn Thr 
245 250 

h* GTA CAC A TG TGG CTT CAA TGG ATT TTT CCA AGG CAA TTT GGA CTT ATA 1982 

D Val His Trp Leu Gin Trp He Phe Pro Arg Gin Phe Gly Leu He 
255 260 265 270 

AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA TCA 2030 
Asn Ala Phe Gin Val Lys Gin Leu His Lys Val He Pro Leu Val Ser 
m 275 280 285 

f* CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT 2078 

P* Gin Ser Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu He 

l?_ 290 295 300 

GAA CAA ACA GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC 2126 
Glu Gin Thr Ala Lys Arg Leu His Arg He Ser Leu Ser Lys Val Tvr 
305 310 ~ 315 

AAC CAT TAT TGC CCA TAT ATT GAC ACC CAC GAT GAT GAA AAA ATC CTT 2174 
Asn His Tyr Cys Pro Tyr He Asp Thr His Asp Asp Glu Lys He Leu 
320 325 330 

AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG TTT CTT CGA TCC ATT 2222 
Ser Tyr Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser He 
335 340 345 ~ 350 

CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA TTT 2270 
Leu Val Arg Val Phe Pro Lys Leu He Trp Gly Asn Gin Arg He Phe 
355 360 365 

GAG ATA ATA TTA AAA G GTATTGTATA AAATTTATTA CCACTAACGA TTTTACCAG AC 2327 
Glu He He Leu Lys Asn 
'370 P 

CTC GAA ACT TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT 2375 
Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser Phe Ser Leu His 
375 380 385 

TAT TTA ATG AGT AAC ATA AAG GTAATATGCC AAATTTTTTT ACCATTAATT 2426 
Tyr Leu Met Ser Asn He Lys 



Q 
11J 



54 



' 330 395 

AACAATCAG ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA AAA AGG TCA 2474 
He Ser Glu He Glu Trp Leu Val Leu Gly Lys Arg Ser 
400 405 

AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT 2522 
Asn Ala Lys Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin He Phe 
410 415 420 

GCG GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA 2570 
Ala Glu Phe He Tyr Trp Leu Tyr Asn Ser Phe He He Pro He Leu 
425 430 435 440 

CAA TCT TTT TTT TAT ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT 2618 
Gin Ser Phe Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr 
445 450 " 455 

GTT TAT TTT AGA AAA GAT ATT TGG AAA CTC TTG TGC CGA CCC TTT ATT 2666 
Val Tyr Phe Arg Lys Asp He Trp Lys Leu Leu Cys Arg Pro Phe He 
460 465 470 

ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA ATA AAC GAG GTATTTTAAA 2715 
Thr Ser Met Lys Met Glu Ala Phe Glu Lys He Asn Glu 
H* 475 480 485 

GTATTTTTTG CAAAAAGCTA ATATTTTCAG AAC AAT GTT AGG ATG GAT ACT CAG 2769 

Asn Asn Val Arg Met Asp Thr Gin 
490 

AAA ACT ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT 2817 
Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys Lys Asn 
495 500 ~ 505 

ACC TTT CGT CTC ATT ACG AAT TTA AGA AAA AGA TTC TTA ATA AAG 2862 
Thr Phe Arg Leu He Thr Asn Leu Arg Lys Arg Phe Leu He Lys 
510 515 520 

GTATTAATTT TTGGTCATCA ATGTACTTTA CTTCTAATCT ATTATTAGCA G ATG GGT 2919 

Met Gly 
525 

TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 
Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu Arg Pro Val 
530 535 540 

GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA 3015 
Ala Ser He Leu Lys His Leu He Asn Glu Glu Ser Ser Gly He Pro 
545 550 555 

TTT AAC TTG GAG GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT 3063 
Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu 
560 565 570 

CTT AAG CAC CGA ATG TTT GG GTAATTATAT AATGCGCGAT TCCTCATTAT 3113 
Leu Lys His Arg Met Phe Gly 
575 580 

TAATTTTGCA G G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA AAA TCC 3161 
Arg Lys Lys Tyr Phe Val Arg He Asp He Lys Ser 
585 590 
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TST TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG 3209 
Cys Tyr Asp Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys Lys 
595 600 605 

AAA CTC AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT 3257 
Lys Leu Lys Asp Pro Glu Phe Val He Arg Lys Tyr Ala Thr He His 
61 0 615 620 625 

GCA ACA AGT GAC CGA GCT ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC 3305 
Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe Ser 
630 635 640 

TAT T GTAAGTTTAT TTTTTCATTG GAATTTTTTA ACAAATTCTT TTTTAG TT 3357 
^ Phe 

GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 
Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser Met Lys Thr 
645 650 655 

TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT 3453 
Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr Lys Ser Ser 
660 665 670 ~ 675 

f»& TCT GAA ATT TTT AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT 3501 

p Ser Glu He Phe Lys Met Leu Lys Glu His Leu Ser Gly His He Val 

p 680 685 690 

U1 AAG GTATACCAAT TGTTGAATTG TAATAACACT AATGAAACTA G ATA GGA AAT 3554 

M ^ He Gly Asn 

m 695 

H TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA ATT CTG TCA 3602 

N Ser Gin Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser 

700 705 " 710 



K 



TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA 3650 
Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu 
715 720 725 



TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT 3698 
Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp 
'> 730 735 740 

TTC CTC TTT ATA ACA GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT 3746 
Phe Leu Phe He Thr Val Asn Lys Lys Asp Ala Lys Lys Phe Leu Asn 
745 750 755 

TTA TCT TTA AGA G GTGAGTTGCT GTCATTCCTA AGTTCTAACC GTTGAAG GA 3798 

Leu Ser Leu Arg Gl y 

760 

TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA ATA AAC 3846 
Phe Glu Lys His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val He Asn 
765 770 775 780 

TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC 3894 
Phe Glu Asn Ser Asn Gly He He Asn Asn Thr Phe Phe Asn Glu Ser 
785 790 795 

AAG AAA AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT 3942 
Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu 
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800 



805 



810 



GAT ACA TTG TTA GCA TGT CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT 3990 
Asp Thr Leu Leu Ala Cys Pro Lys lie Asp Glu Ala Leu Phe Asn Ser 
815 820 825 

ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG AAA TCT TTT TTT TAC AAA 4038 
Thr Ser Val Glu Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys 
830 835 840 

ATT CTA AG GTATACTGTG TAACTGAATA ATAGCTGACA AATAATCAG A TCG 4089 

lie Leu Arg Ser 

845 



ri 



AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA 4137 
Ser Leu Ala Ser Phe Ala Gin Val Phe lie Asp lie Thr His Asn Ser 
850 855 860 

AAA TTC AAT TCT TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT 4185 
Lys Phe Asn Ser Cys Cys Asn lie Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

ATG AGA GCA CAA GCA TAC TTA AAA AGG ATG AAG GAT ATA TTT ATT CCC 4233 
Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He Phe He Pro 
885 890 895 

CAA AGA ATG TTC ATA ACG G GTGAGTACTT ATTTTAACTA GAAAAGTCAT 4282 
Gin Arg Met Phe He Thr 
900 



TAATTAACCT TAG AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA 
Asp Leu Leu Asn Val He Gly Arg Lys He Trp Lys 
905 910 



4330 



p i 

m 



AAG TTG GCC GAA ATA TTA GGA TAT ACG AGT AGG CGT TTC TTG TCC TCT 4378 
Lys Leu Ala Glu He Leu Gly Tyr Thr Ser Arg Arg Phe Leu Ser Ser 
915 920 925 930 

GCA GAA GTC AAA TG GTACGTGTCG GTCTCGAGAC TTCAGCAATA TTGACACATC 4432 
Ala Glu Val Lys Trp 
935 

AG G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA CCC TCT TTC AAA 4480 
Leu Phe Cys Leu Gly Met Arg Asp Gly Leu Lys Pro Ser Phe Lys 
940 945 950 



TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 
Tyr His Pro Cys Phe Glu Gin Leu He Tyr Gin Phe Gin Ser Leu Thr 
955 960 965 



4528 



GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA 
Asp Leu He Lys Pro Leu Arg Pro Val Leu Arg Gin Val Leu Phe Leu 
970 975 980 



4576 



CAT AGA AGA ATA OCT GAT TAATGTCATT TTCAATTTAT TATATACATC 
His Arg Arg He Ala Asp 
985 



4624 



CTTTATTACT GGTGTCTTAA ACAATATTAT TACTAAGTAT AGCTGACCCC CAAAGCAAGC 4684 
ATACTATAGG ATTTCTAGTA AAGTAAAATT AATCTCGTTA TTAGTTTTGA TTGACTTGTC 4744 
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TTTATCCTTA 


TACTTTTAAG 


AAAGATTGAC 


AGTGGTTGCT GACTACTGCC CACATGCCCA 


4804 


TTAAACGGGA 


GTGGTTAAAC 


ATTAAAAGTA 


ATACATGAGG CTAATCTCCT TTCATTTAGA 


4864 


ATAAGGAAAG 


TGGTTTTCTA 


TAATGAATAA 


TGCCCGCACT AATGCAAAAA GACGAAGATT 


4924 


ATCTTCTAAA 


CAAGGGGGAT 


TAAGCATATC 


CGAAGGAAAA GAGAGTAATA TACCCAGTGT 


4984 


TGTTGAAGAA 


AGCAAGGATA 


ATTTGGAACA 


AGCTTCTGCA GATGACAGGC TAAATTTTGG 


5044 


TGACCGAATT 


TTGGTAAAAG 


CCCCAGGTTA 


TCCATGGTGG CCGGCCTTGC TACTGAGACG 


5104 


AAAAGAAACT 


AAGGATAGTT 


TGAATACTAA 


TAGCTCATTT AATGTCTTAT ATAAGGTTTT 


5164 


GTTTTTTCCT 


GACTTCAATT 


TTGCATGGGT 


GAAAAGAAAT AGTGTTAAGC CATTATTGGA 


5224 


TTCCGAAATA 


GCCAAATTTC 


TTGGTTCCTC 


AAAGCGGAAG TCTAAAGAAC TTATTGAAGC 


5284 


TTATGAGGCT 


TCAAAAACTC 


CTCCTGATTT 


AAAGGAGGAA TCTTCCACCG ATGAGGAAAT 


5344 


GGAT AGCTTA 


TCAGCTGCTG 


AGGAGAAGCC 


TAATTTTTTG CAAAAAAGAA AATATCATTG 


5404 


GGAGACATCT 


CTTGATGAAT 


CAGATGCGGA 


GAGTATCTCC AGCGGATCCT TGATGTCAAT 


5464 


AACTTCTATT 


TCTGAAATGT 


ATGGTCCTAC 


TGTCGCTTCG ACTTCTCGTA GCTCTACGCA 


5524 


GTTAAGTGAC 


CAAAGGTACC 






5544 



(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 
; (A) LENGTH: 988 amino acids 

H* (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



pf (ii) MOLECULE TYPE: protein 

P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

01 

Met Thr Glu His His Thr Pro Lys Ser Arg He Leu Arg Phe Leu Glu 
1 5 10 '15 

Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 " 30 

Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn He Cys Glu Arg Leu 
35 40 45 

Arg Ser Asp Val Gin Thr Ser Phe Ser He Phe Leu His Ser Thr Val 
50 55 60 

Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

Lys Cys Ser Gin Ser Glu Leu He Ala Asn Val Val Lys Gin Met Phe 
85 90 95 

Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe Ser 
100 105 110 

Met Asn His Glu Asp Phe Arg Ala Met His Val Asn Gly Val Gin Asn 

58 



115 120 125 

Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu He Ser He Leu Glu Ser 
130 135 140 

Lys Asn Tip Gin Leu Leu Leu Glu He He Gly Ser Asp Ala Met His 
145 150 155 160 

Tyr Leu Leu Ser Lys Gly Ser He Phe Glu Ala Leu Pro Asn Asp Asn 
165 170 175 

Tyr Leu Gin He Ser Gly He Pro Leu Phe Lys Asn Asn Val Phe Glu 
180 185 190 

Glu Thr Val Ser Lys Lys Arg Lys Arg Thr He Glu Thr Ser He Thr 
135 200 205 

Gin Asn Lys Ser Ala Arg Lys Glu Val Ser Trp Asn Ser He Ser He 
210 215 220 

Ser Arg Phe Ser He Phe Tyr Arg Ser Ser Tyr Lys Lys Phe Lys Gin 
225 230 235 240 

Asp Leu Tyr Phe Asn Leu His Ser He Cys Asp Arg Asn Thr Val His 
245 250 255 

Met Trp Leu Gin Trp He Phe Pro Arg Gin Phe Gly Leu He Asn Ala 
260 265 270 

Phe Gin Val Lys Gin Leu His Lys Val He Pro Leu Val Ser Gin Ser 
275 280 285 

Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu He Glu Gin 
290 295 300 

Thr Ala Lys Arg Leu His Arg He Ser Leu Ser Lys Val Tyr Asn His 
305 310 315 320 

Tyr Cys Pro Tyr He Asp Thr His Asp Asp Glu Lys He Leu Ser Tyr 
325 330 ~ 335 

Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser He Leu Val 
340 345 ~ 350 

Arg val Phe Pro Lys Leu He Trp Gly Asn Gin Arg He Phe Glu He 
355 360 365 

He Leu Lys Asp Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser 
370 375 380 

Phe Ser Leu His Tyr Leu Met Ser Asn He Lys He Ser Glu He Glu 
385 390 395 400 

Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys Met Cys Leu Ser Asp 

405 410 415 

Phe Glu Lys Arg Lys Gin He Phe Ala Glu Phe He Tyr Trp Leu Tyr 
420 425 430 

Asn Ser Phe He He Pro He Leu Gin Ser Phe Phe Tyr He Thr Glu 
435 440 445 
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Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys Asp He Trp 
450 455 460 

Lys Leu Leu Cys Arg Pro Phe He Thr Ser Met Lys Met Glu Ala Phe 
465 " 470 475 480 

Glu Lys He Asn Glu Asn Asn Val Arg Met Asp Thr Gin Lys Thr Thr 
485 490 495 

Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg 
500 505 510 

Leu He Thr Asn Leu Arg Lys Arg Phe Leu He Lys Met Gly Ser Asn 
515 520 525 

Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu Arg Pro Val Ala Ser 
530 535 540 

He Leu Lys His Leu He Asn Glu Glu Ser Ser Gly He Pro Phe Asn 
545 550 555 560 

Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu Leu Lys 
565 570 575 

His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val Arg He Asp He Lys 
580 * 585 590 

Ser Cys Tyr Asp Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys 
595 ^ 600 605 

Lys Lys Leu Lys Asp Pro Glu Phe Val He Arg Lys Tyr Ala Thr He 
610 615 620 

His Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe 
625 630 635 640 

Ser Tyr Phe Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser 
645 650 655 

Met Lys Thr Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr 
660 665 670 

Lys Ser Ser Ser Glu He Phe Lys Met Leu Lys Glu His Leu Ser Gly 
675 680 685 

His He Val Lys He Gly Asn Ser Gin Tyr Leu Gin Lys Val Gly He 
690 695 700 

Pro Gin Gly Ser He Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
705 710 715 720 

Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val 
725 730 735 

Leu Leu Arg Val Val Asp Asp Phe Leu Phe He Thr Val Asn Lys Lys 
740 745 750 

Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His 
755 760 765 



Asn Phe Ser Thr Ser Leu Glu Lys Thr Val He Asn Phe Glu Asn Ser 
770 775 780 
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Asn Gly lie He Asn Asn Thr Phe Phe Asn Glu Ser Lys Lys Arg Met 
785 790 795 " 800 

Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu Asp Thr Leu Leu 
805 810 815 

Ala Cys Pro Lys He Asp Glu Ala Leu Phe Asn Ser Thr Ser Val Glu 
820 825 830 

Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys He Leu Arg Ser 
835 840 845 

Ser Leu Ala Ser Phe Ala Gin Val Phe He Asp He Thr His Asn Ser 
850 855 860 

Lys Phe Asn Ser Cys Cys Asn He Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He Phe He Pro 
885 890 895 

Gin Arg Met Phe He Thr Asp Leu Leu Asn Val He Gly Arg Lys He 
900 905 910 

Trp Lys Lys Leu Ala Glu He Leu Gly Tyr Thr Ser Arg Arg Phe Leu 
915 920 925 

Ser Ser Ala Glu Val Lys Trp Leu Phe Cys Leu Gly Met Arg Asp Gly 
930 935 - 940 

Leu Lys Pro Ser Phe Lys Tyr His Pro Cys Phe Glu Gin Leu He Tyr 
945 950 955 960 

Gin Phe Gin Ser Leu Thr Asp Leu He Lys Pro Leu Arg Pro Val Leu 
965 970 975 

Arg Gin Val Leu Phe Leu His Arg Arg He Ala Asp 
980 985 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = guanosine modified by 
bio tin group" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
NCCTATTTYT TYTAYNNNAC NGA 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Phe Phe Tyr Xaa Thr Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCAGATATNA DNARRAARTC RTC 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa = Phe, He or Leu" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Asp Asp Phe Leu Xaa He 
1 5 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
ACAATGMGNH TNHTNCCNAA RAA 23 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2. ,3 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = Leu or He" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Arg Xaa Xaa Pro Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
ACGAATCKNG GDATNSWRTC RTARCA 26 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Cys Tyr Asp Ser He Pro Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CAATTCTCRT ARCANSWYTT DATRTC 26 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

p (ii) MOLECULE TYPE: peptide 

a 

Ul (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Q ? | Asp lie Lys Ser Cys Tyr Asp 

U 1 5 

s (2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

GATTACTCCC GAAGAAAGGA TCTTTCCGTC CAATCATGAC TTTCTTAAGA AAGGACAAGC 60 

AAAAAAATAT TAAGTTAAAT CTAAATTAAA TTCTAATGGA TAGCCAACTT GTGTTTAGGA 120 

ATTTAAAAGA CATGCTGGGA TAAAAGATAG GATACTCAGT CTTTGATAAT AAACAAATTT 180 

CAGAAAAATT TGCCTAATTC ATAGAGAAAT GGAAAAATAA AGGAAGACCT CAGCTATATT 240 

ATGTCACTCT AGACATAAAG ACTTGCTAC 269 
(2) INFORMATION FOR SEQ ID NO:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

AAACACAAGG AAGGAAGTCA AATATTCTAT TACCGTAAAC CAATATGGAA ATTAGTGAGT 60 

AAATTAACTA TTGTCAAAGT AAGAATTTAG TTTTCTGAAA AGAATAAATA AATGAAAAAT 120 

AATTTTTATC AAAAAATTTA GCTTGAAGAG GAGAATTTGG AAAAAGTTGA AGAAAAATTG 180 

ATACCAGAAG ATTCATTTTA GAAATACCCT CAAGGAAAGC TAAGGATTAT ACCTAAAAAA 240 

GGATCTTTCC GTCCAATCAT GACTTTCTTA AGAAAGGACA AGCAAAAAAA TATTAAGTTA 300 
AATCTAAATT AAATTCTAAT GGATAGCCAA CTTGTGTTTA GGAATTTAAA AGACATGCTG . 360 

GGATAAAAGA TAGGATACTC AGTCTTTGAT AATAAACAAA TTTCAGAAAA ATTTGCCTAA 420 

TTCATAGAGA AATGGAAAAA TAAAGGAAGA CCTCAGCTAT ATTATGTCAC TCTA 474 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Lys His Lys Glu Gly Ser Gin lie Phe Tyr Tyr Arg Lys Pro lie Trp 
15 10 15 

Lys Leu Val Ser Lys Leu Thr lie Val Lys Val Arg lie Gin Phe Ser 
20 25 30 

Glu Lys Asn Lys Gin Met Lys Asn Asn Phe Tyr Gin Lys lie Gin Leu 
35 40 45 

Glu Glu Glu Asn Leu Glu Lys Val Glu Glu Lys Leu lie Pro Glu Asp 
50 55 60 

Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg lie lie Pro Lys Lys 
65 70 75 80 

Gly Ser Phe Arg Pro lie Met Thr Phe Leu Arg Lys Asp Lys Gin Lys 
85 90 95 

Asn lie Lys Leu Asn Leu Asn Gin lie Leu Met Asp Ser Gin Leu Val 
100 105 110 

Phe Arg Asn Leu Lys Asp Met Leu Gly Gin Lys lie Gly Tyr Ser Val 
115 120 125 

Phe Asp Asn Lys Gin lie Ser Glu Lys Phe Ala Gin Phe lie Glu Lys 
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130 135 140 

Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 " 150 155 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Phe Phe Tyr Cys Thr Glu lie Ser Ser Thr Val Thr lie Val Tyr Phe 
1 5 10 15 

Arg His Asp Thr Trp Asn Lys Leu He Thr Pro Phe He Val Glu Tyr 
20 25 30 



Phe Lys Thr Tyr Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser 
35 40 45 



□ 

111 Tyr Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg He He Pro Lys 

4S 50 55 60 

m 

Lys Ser Asn Asn Glu Phe Arg He He Ala He Pro Cys Arg Gly Ala 
65 70 75 80 

Asp Glu Glu Glu Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He 
m 85 90 95 

Gin Pro Thr Gin Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr 
100 105 110 



Ser Phe Thr Lys He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys 
115 120 125 

Glu Phe Lys Gin Arg Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu 
130 135 140 

Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
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Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr 
15 10 15 

Tyr Arg Lys Asn lie Trp Asp Val lie Met Lys Met Ser lie Ala Asp 
20 * 25 30 

Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu 
35 40 45 

Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro 
50 55 60 

Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys lie Val 
65 70 75 80 

Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu 
85 90 95 

Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp 
100 105 110 

Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr 
115 120 125 

Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe 
130 135 140 



Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp 
145 150 ~ 155 



INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Lys His Lys Glu Gly Ser Gin lie Phe Tyr Tyr Arg Lys Pro lie Trp 
1 5 10 15 

Lys Leu Val Ser Lys Leu Thr He Val Lys Val Arg He Gin Phe Ser 
20 25 30 

Glu Lys Asn Lys Gin Met Lys Asn Asn Phe Tyr Gin Lys He Gin Leu 
35 40 45 

Glu Glu Glu Asn Leu Glu Lys Val Glu Glu Lys Leu He Pro Glu Asp 
50 55 60 

Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg He He Pro Lys Lys 
65 70 75 80 

Gly Ser Phe Arg Pro He Met Thr Phe Leu Arg Lys Asp Lys Gin Lys 
85 90 95 
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Asn lie Lys Leu Asn Leu Asn Gin lie Leu Met Asp Ser Gin Leu Val 
100 105 no 

Phe Arg Asn Leu Lys Asp Met Leu Gly Gin Lys lie Gly Tyr Ser Val 
115 120 * 125 

Phe Asp Asn Lys Gin He Ser Glu Lys Phe Ala Gin Phe He Glu Lys 
130 135 140 

Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly He His Ser Ala 
1 5 10 15 



Leu Lys Thr Cys Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser Trp 

31 20 25 30 



L He Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys 

35 40 45 



Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala Thr 
50 55 60 



P Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys Glu 

JjM 65 70 75 80 

Q 

m Val Phe Ser Thr Gly Leu Met He Glu Leu He Asp Lys Cys Leu Val 

85 90 95 

Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys 
100 105 110 

Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu Leu 
115 120 125 

Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn 
130 135 140 

Gin Val Arg Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr Thr 
145 150 155 " 160 

Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe 
165 170 175 

Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp 
180 185 190 
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Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys 
195 200 205 

Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn 
210 " ' 215 220 

Asn lie Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg lie 
225 230 235 240 

Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His 
245 " 250 255 

Glu Phe Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala Gin 
260 265 270 

Thr He Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys Leu 
275 280 285 

Lys Asp Lys Val He Glu Lys He Ala Tyr Met Leu Glu Lys Val Lys 
290 295 300 

Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu 
305 310 * 315 320 

Asn Trp Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys Thr 
325 330 335 

Arg Glu- Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr 
340 345 350 

Asp Asn Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn He 
355 360 365 

Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys 
370 375 380 

Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His Lys 
385 ^ 390 395 400 

Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met Gin 
405 410 415 

Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn He 
420 425 430 

Tyr Val Leu Trp Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val Val 
435 440 445 

Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr 
450 ~ 455 460 

Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met Lys 
465 470 475 480 

Met Ser He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu 
485 490 495 

Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys 
500 505 510 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe 
515 520 525 
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Asn Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr 
530 535 540 

Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn 
545 550 555 560 

Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp 
565 570 575 

Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly 
580 585 590 

Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp 
595 600 605 

Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu 
610 615 620 

Leu Ser Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg Lys 
625 630 635 640 

Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys 
645 650 655 

Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin 
660 665 670 

Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn 
675 680 685 

Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys 
690 695 700 

Lys Asp Asn Leu Leu Gin Pro Val He Asn He Cys Gin Tyr Asn Tyr 
705 " 710 715 720 

He Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin 
725 730 735 

Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu 
740 745 750 

Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu 
755 760 765 

Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu 
770 775 780 

He Thr Thr Gin Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu He 
785 790 795 800 

Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin 
805 810 815 

Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met Asp 
820 825 830 

Ser Val Glu Glu Gin Asn lie Val Gin Asp Tyr Cys Asp Trp He Gly 
835 840 845 

He Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn Leu 
850 855 860 
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Arg lie Glu Gly lie Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys 
865 870 875 880 

Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn 
885 ~ 

Asn lie Thr His Tyr Phe Arg Lys Thr lie Thr Thr Glu Asp Phe Ala 
900 905 910 

Asn Lys Thr Leu Asn Lys Leu Phe lie Ser Gly Gly Tyr Lys Tyr Met 
915 " 920 * 925 

Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met 
930 935 940 

Ser Ser Met lie Asp Leu Glu Val Ser Lys lie lie Tyr Ser Val Thr 
945 950 955 960 

Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn lie Lys Asp Thr lie Phe 
965 970 975 

Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe 
980 985 990 

lie Glu lie Phe Ser Thr Lys Lys Tyr lie Phe Asn Arg Val Cys 
995 1000 1005 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTGAAGGCAC TGTTCAGCG 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

GTGGATGATT TCTTGTTGG 

(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
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(B) TYPE: nucleic acid 

(C) S TRANDEDNE S S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ATGCTCCTGC GTTTGGTGG 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CTGGACACTC AGCCCTTGG 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
GGCAGGTGTG CTGGACACT 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 
TTTGATGATG CTGGCGATG 



(2) INFORMATION FOR SEQ ID NO: 93: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
GGGGCTCGTC TTCTACAGG 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
CAGCAGGAGG ATCTTGTAG 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
TGACCCCAGG AGTGGCACG 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
TCAAGCTGAC TCGACACCG 



(2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGGCGTGACA GGGCTGC 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GCTGAAGGCT GAGTGTCC 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TAGTCCATGT TCACAATCG 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 22.. 1716 

(D) OTHER INFORMATION: /note= "EcoRI-NotI insert of 

clone 712562 encoding 63 kDa 
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telomerase protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 

TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 

AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGAGCTG 180 

TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 240 

CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA CGTCGTGGGA 300 

GCCAGAACGT TCCGCAGAGA AAAGARGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 

TTCAGCGTGC TCAACTACGA GCGGGCGCGG CGCCCCGGCC TCCTGGGCGC CTCTGTGCTG 420 

GGCCTGGACG ATATCCACAG GGCCTGGCGC ACCTTCGTGC TGCGTGTGCG GGCCCAGGAC 480 

CCGCCGCCTG AGCTGTACTT TGTCAAGGTG GATGTGACGG GCGCGTACGA CACCATCCCC 540 

CAGGACAGGC TCACGGAGGT CATCGCCAGC ATCATCAAAC CCCAGAACAC GTACTGCGTG 600 

b CGTCGGTATG CCGTGGTCCA GAAGGCCGCC ATGGGCACGT CCGCAAGGCC TTCAAGAGCC 660 

P 

0 ACGTCCTACG TCCAGTGCCA GGGGATCCCG CAGGGCTCCA TCCTCTCCAC GCTGCTCTGC 720 

Hi AGCCTGTGCT ACGGCGACAT GGAGAACAAG CTGTTTGCGG GGATTCGGCG GGACGGGCTG 780 

Bl CTCCTGCGTT TGGTGGATGA TTTCTTGTTG GTGACACCTC ACCTCACCCA CGCGAAAACC 840 

Uft 

^ TTCCTCAGGA CCCTGGTCCG AGGTGTCCCT GAGTATGGCT GCGTGGTGAA CTTGCGGAAG 900 

f - ACAGTGGTGA ACTTCCCTGT AGAAGACGAG GCCCTGGGTG GCACGGCTTT TGTTCAGATG 960 

M 

CCGGCCCACG GCCTATTCCC CTGGTGCGGC CTGCTGCTGG ATACCCGGAC CCTGGAGGTG 1020 

CAGAGCGACT ACTCCAGCTA TGCCCGGACC TCCATCAGAG CCAGTCTCAC CTTCAACCGC 1080 

GGCTTCAAGG CTGGGAGGAA CATGCGTCGC AAACTCTTTG GGGTCTTGCG GCTGAAGTGT 1140 

CACAGCCTGT TTCTGGATTT GCAGGTGAAC AGCCTCCAGA CGGTGTGCAC CAACATCTAC 1200 

AAGATCCTCC TGCTGCAGGC GTACAGGTTT CACGCATGTG TGCTGCAGCT CCCATTTCAT 1260 

CAGCAAGTTT GGAAGAACCC ACATTTTTCC TGCGCGTCAT CTCTGACACG GCTCCCTCTG 1320 

CTACTCCATC CTGAAAGCCA AGAACGCAGG GATGTCGCTG GGGGCCAAGG GCGCCGCCGG 1380 

CCCTCTGCCC TCCGAGGCCG TGCAGTGGCT GTGCCACCAA GCATTCCTGC TCAAGCTGAC 1440 

TCGACACCGT GTCACCTACG TGCCACTCCT GGGGTCACTC AGGACAGCCC AGACGCAGCT 1500 

GAGTCGGAAG CTCCCGGGGA CGACGCTGAC TGCCCTGGAG GCCGCAGCCA ACCCGGCACT 1560 

GCCCTCAGAC TTCAAGACCA TCCTGGACTG ATGGCCACCC GCCCACAGCC AGGCCGAGAG 1620 

CAGACACCAG CAGCCCTGTC ACGCCGGGCT TATACGTCCC AGGGAGGGAG GGGCGGCCCA 1680 

CACCCAGGCC TGCACCGCTG GGAGTCTGAG GCCTGAGTGA GTGTTTGGCC GAGGCCTGCA 1740 

75 



TGTCCGGCTG AAGGCTGAGT GTCCGGCTGA GGCCTGAGCG AGTGTCCAGC CAAGGGCTGA 1800 

GTGTCCAGCA CACCTGCGTT TTCACTTCCC CACAGGCTGG CGTTCGGTCC ACCCCAGGGC 1860 

CAGCTTTTCC TCACCAGGAG CCCGGCTTCC ACTCCCCACA TAGGAATAGT CCATCCCCAG 1920 

ATTCGCCATT GTTCACCCTT CGCCCTGCCT TCCTTTGCCT TCCACCCCCA CCATTCAGGT 1980 

GGAGACCCTG AGAAGGACCC TGGGAGCTTT GGGAATTTGG AGTGACCAAA GGTGTGCCCT 2040 

GTACACAGGC GAGGACCCTG CACCTGGATG GGGGTCCCTG TGGGTCAAAT TGGGGGGAGG 2100 

TGCTGTGGGA GTAAAATACT GAATATATGA GTTTTTCAGT TTTGGAAAAA AAAAAAAAAA 2160 

AAAAAAAAAA A 2171 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1..564 

(D) OTHER INFORMATION: /note= "63 kDa telomerase protein 

encoded by ORF of EcoRI-NotI insert of 
clone 712562" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 
1 5 10 15 

Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu Lys Arg Val 
35 40, 45 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 
65 70 75 80 

Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr 
85 90 95 

Phe Arg Arg Glu Lys Xaa Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 " 105 110 

Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu 
115 120 125 
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Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg Ala Trp Arg Thr 
130 135 140 

Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe 
145 150 155 160 

Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr lie Pro Gin Asp Arg 
165 170 175 

Leu Thr Glu Val lie Ala Ser lie lie Lys Pro Gin Asn Thr Tyr Cys 
180 185 190 

Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala Met Gly Thr Ser Ala 
195 200 205 

Arg Pro Ser Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly lie Pro Gin 
210 215 220 

Gly Ser lie Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
225 230 235 * * ~ 

Glu Asn Lys Leu Phe Ala Gly lie Arg Arg Asp Gly Leu Leu Leu Arg 
245 250 255 

Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
260 265 270 

Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val 
275 280 285 

Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala 
290 295 300 

Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
305 310 315 320 

■ Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
325 330 335 

Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe Asn 
340 345 350 

Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val 
355 360 365 

Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
370 375 380 

Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin Ala 
385 390 395 400 

Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
405 410 415 

Trp Lys Asn Pro His Phe Ser Cys Ala Ser Ser Leu Thr Arg Leu Pro 
420 425 430 

Leu Leu Leu His Pro Glu Ser Gin Glu Arg Arg Asp Val Ala Gly Gly 
435 440 445 



Gin Gly Arg Arg Arg Pro Ser Ala Leu Arg Gly Arg Ala Val Ala Val 
450 455 460 
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Pro Pro Ser lie Pro Ala Gin Ala Asp Ser Thr Pro Cys His Leu Arg 
465 470 475 480 

Ala Thr Pro Gly Val Thr Gin Asp Ser Pro Asp Ala Ala Glu Ser Glu 
485 490 495 

Ala Pro Gly Asp Asp Ala Asp Cys Pro Gly Gly Arg Ser Gin Pro Gly 
500 505 510 

Thr Ala Leu Arg Leu Gin Asp His Pro Gly Leu Met Ala Thr Arg Pro 
515 520 525 

Gin Pro Gly Arg Glu Gin Thr Pro Ala Ala Leu Ser Arg Arg Ala Tyr 
530 535 540 

Thr Ser Gin Gly Gly Arg Gly Gly Pro His Pro Gly Leu His Arg Trp 
545 - 550 555 560 

Glu Ser Glu Ala 



(2) INFORMATION FOR SEQ ID NO: 102: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

m : 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

CCAGTGAGCA GAGTGACGAG GACTCGAGCT CAAGCTTTTT titTTTTTTT 50 

M 

K (2) INFORMATION FOR SEQ ID NO: 103: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i) 



Q 



0 



(i) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CCAGTGAGCA GAGTGACG 18 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GAGGACTCGA GCTCAAGC 18 
<2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CACTGATCCT TTCTTTTTCG TAAACGATAG GT 32 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
. CATCAATCAA ATCTTCCATA TAGAAATGAC A 31 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = 5' -phosphorylated guanosine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
NGGCCGTGTT GGCCTAGTTC TCTGCTC 27 

(2) INFORMATION FOR SEQ ID NO: 108: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 
GAGGAGGAGA AGAGCAGAGA ACTAGGCCAA CACGCCCC 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
GTGTCATTTC TATATGGAAG ATTTGATTGA TG 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110 
ACCTATCGTT TACGAAAAAG AAAGGATCAG TG 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
GAGTGACATA ATATACGTGA 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 



80 



La. 

o 



(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe 
15 10 15 

Tyr Arg Lys Ser Val Trp Ser Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



ill Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser Glu Ala Glu 

$ 1 5 10 15 

f ! Val Arg Gin His Arg Glu Ala 

r 20 

k (2) INFORMATION FOR SEQ ID NO: 114: 

M (i) SEQUENCE CHARACTERISTICS: 
H (A) LENGTH: 27 amino acids 

. pj (B) TYPE: amino acid 

p*/ (C) STRANDEDNESS: 

5* (D) TOPOLOGY: linear 

•ru 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
1 5 10 15 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
15 10 15 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly 
1 5 10 15 

lie Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro Lys Pro Asp Gly 
15 10 15 

Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
YARACHAARG GHATYCCHYA RGG 



23 



o 

Hi 
flj 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Gin Thr Lys Gly lie Pro Gin Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



U1 (D) TOPOLOGY: linear 



gi- (ii) MOLECULE TYPE: DNA 

M (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

NGTNATDARD ARRTARTCRT C 21 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121; 

Asp Asp Tyr Leu Leu lie Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Lys Gly lie Pro Gin Gly Leu Cys Val Ser Ser lie Leu Ser Ser Phe 
15 10 15 

Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu 
20 25 30 

Ser Met Asia Pro Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 45 

Asp Asp Tyr Leu Leu lie Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Ser lie Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu lie 
15 10 15 

Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg 
20 25 30 

Val Val 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE:, amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro lie Val Asp Leu 
15 10 15 

Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys Ala Ser Pro 
20 25 30 

Ser Gin Asp Thr Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie 
35 40 45 

Ser 
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(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Gin Lys Val Gly He Pro Gin Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



jg (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

m 



CAAAAAGTTG GTATCCCTCA GGG 23 
(2) INFORMATION FOR SEQ ID NO: 127: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 
H < c > STRANDEDNESS: single 
m (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

AGACCAAAGG AATTCCATCA GGCTCAATTC TGTCATCTTT TTTGTGTCAT TTCTATATGG 60 

AAGATTTGAT TGATGAATAC CTATCGTTTA CGAAAAAGAA AGGATCAGTG TTGTTACGAG 120 

TAGTCGACGA CTACCTCCTC ATCACC 146 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Lys Gly He Pro Ser Gly Ser He Leu Ser Ser Phe Leu Cys His Phe 
15 10 15 

Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys 
20 25 30 

Gly Ser Val Leu Leu^Arg Val Val Asp Asp Tyr Leu Leu He Thr 
35 40 * 45 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
GACGATTTCC TCTTTATAAC A 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Asp Phe Leu Phe He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
AAAAAAAAAA AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 132: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
TTTTTTTTTT TTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..35 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 

Trp Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 
15 10 15 

He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 
20 25 30 

Asp He Trp 
35 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Ala Val lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr 
1 5 10 15 
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Asn Leu Arg Lys Arg Phe 
20 



(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Lys Lys Tyr Phe Val Arg He Asp He Lys Ser Cys Tyr Asp Arg He 
1 5 10 15 

Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 136 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1. .32 

(D) OTHER INFORMATION: /note= "motif 4 (B' ) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser Ser Phe 
15 10 15 

Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .49 

(D) OTHER INFORMATION: /note- "motif 5(C) and 6(D) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe He Thr Val Asn Lys 
1 5 10 15 

Lys Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys 
20 25 30 

His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val He Asn Phe Glu Asn 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Trp Leu Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr 
15 10 15 

Cys Thr <31u He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
20 25 30 

Thr Trp 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .25 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:139: 

Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn Glu Phe Arg He 
1 5 10 15 

He Ala He Pro Cys Arg Gly Ala Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 
5f (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



r 5 



5s? I 



m 



Q 
fli 



(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser He 
15 10 15 

Pro Arg Met Glu Cys Met Arg He Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
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(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /note= "motif 4{B') peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Tyr He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro 
1 5 10 15 

He Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe 
20 25 30 



) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1..49 

(D) OTHER INFORMATION: /note= "motif 5(C) peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Leu He Leu Lys Leu Ala Asp Asp Phe Leu He He Ser Thr Asp Gin 
1 5 10 is 

Gin Gin Val He Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys 
20 25 30 

Tyr Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala Val Ser Ser Gin 
35 40 45 

Ser 



INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..35 

. (D) OTHER INFORMATION: /note= "motif 0 peptide from 

Euplotes aediculatus pl23" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

Trp He Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
15 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 25 30 

Asn He Trp 
35 



INFORMATION FOR SEQ ID NO : 144 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Gly Lys Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met 
15 10 15 

Thr Phe Asn Lys Lys He Val 
20 



INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser Val 
1 5 10 * 15 



92 



Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 



INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
<B) LOCATION: 1..32 

(D) OTHER INFORMATION: /note= "motif 4 (B' ) peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser Ser He 
15 10 15 

Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe 
20 25 30 



INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..49 

(D) OTHER INFORMATION: /note= "motif 5(C) and 6(D) peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu 
15 10 15 

Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu 
20 25 30 

Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser Phe Pro Leu 
35 40 45 

Ser 
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(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin 
15 10 15 

Lys Ser Tyr Ser Lys Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He Pro Lys Lys 
15 10 15 

Thr Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He Val 
20 25 ^ * 30 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note* "motif A peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser 
15 10 * 15 

Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 

Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin Gly Leu Cys 
15 10 15 

Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Euplotes aediculatus pl23" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie 
1 5 10 15 

Thr Thr Gin Glu Asn Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Euplotes aediculatus pl23" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
1 5 * * 10 15 

(2) INFORMATION FOR SEQ ID NO: 154: 



W (i) SEQUENCE CHARACTERISTICS: 

H (A) LENGTH: 22 amino acids 

j*$ (B) TYPE: amino acid 

f«». (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr lie Thr Glu Ser Ser 
15 10 15 

Asp Leu Arg Asn Arg Thr 
20 



<2) INFORMATION FOR SEQ ID NO: 155: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Gin Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys 
1 5 10 15 

Asn Thr Phe Arg Leu lie Thr Asn Leu Arg Lys Arg Phe Leu 
20 25 ~ 30 

p (2) INFORMATION FOR SEQ ID NO: 156: 

□ (i) SEQUENCE CHARACTERISTICS: 

U! (A) LENGTH: 27 amino acids 

JE <B) TYPE: amino acid 

(C) STRANDEDNESS: 
<D) TOPOLOGY: linear 



en 



O 
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(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /notes "motif A peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Arg Lys Lys Tyr Phe Val Arg He Asp He Lys Ser Cys Tyr Asp Arg 
15 10 15 

He Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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( ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:157: 

Gly Asn Ser Gin Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He 
1 5 10 15 

Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 15 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



□ (ix) FEATURE: . 

m (A) NAME/KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif C peptide from 
01 Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe He 
15 10 15 

Thr Val Asn Lys Lys Asp 
20 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
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Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His Asn Phe Ser Thr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:160: 

Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys Thr Glu lie Ser 
15 10 15 

Ser Thr Val Thr He Val 
20 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .32 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg He He Pro Lys Lys 
1 5 10 15 

Ser Asn Asn Glu Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
1 5 10 15 

lie Pro Arg Met Glu Cys Met Arg He Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
U (C) STRANDEDNESS: 

LTI (D) TOPOLOGY: linear 



r .: 



81 



(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

Glu Asp Lys Cys Tyr He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser 
1 5 10 15 

Leu Ser Ala Pro He Val Asp Leu Val Tyr Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 
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(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Ser Gin Asp Thr Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie 
15 10 ~ 15 

Ser Thr Asp Gin Gin Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Saccharomyces cerevisiae EST2 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 166 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr 
15 10 15 

Phe Gin Lys Asn Arg Leu 
20 
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(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..30 

{D) OTHER INFORMATION: /note= "motif 0 peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro 
15 10 15 

Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 168 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note** "motif A peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr 
15 10 15 

He Pro Gin Asp Arg Leu Thr Glu Val He Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif B peptide from human 

telomerase core protein 1 (TCP1) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser lie 
15 10 15 

Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "motif C peptide from human 

telomerase core protein 1 (TCP1) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val 
15 10 15 

Thr Pro His Leu Thr His 
20 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "motif D peptide from human 

telomerase core protein 1 (TCP1) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Phe Phe Tyr Val Thr Glu 
1 5 



L * (2) INFORMATION FOR SEQ ID NO: 173: 

put 

P (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 4029 base pairs 

y*] (B) TYPE: nucleic acid 

f£ (C) STRANDEDNESS: single 

S (D) TOPOLOGY: linear 



u 



(ii) MOLECULE TYPE: cDNA 



^ (ix) FEATURE: 

W (A) NAME /KEY: 

H (B) LOCATION : 1. .4029 

\& (D) OTHER INFORMATION: /notes "preliminary sequence for 

human TRT cDNA insert of 
plasmid pGRN121" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCGATGCC 60 

GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC AGCCACTACC GCGAGGTGCT 120 

GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG 180 

GGACCCGGCG GCTTTCCGCG CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN 240 

ANGGCNGCCC CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 300 

CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT TCGGCTTCGC 360 

GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC ACCACCAGCG TGCGCAGCTA 420 

CCTGCCCAAC ACGGTGACCG ACGCACTGCG GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG 480 

CCGCGTGGGC GACGACGTGC TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT 540 
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GGNTCCCAGC TGCGCCTACC ANGTGTGCGG 
TCAGGCCCGG CCCCCGCCAC ACGCTANTGG 
GGAACCATAG CGTCAGGGAG GCCGGGGTCC 
GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
CCCTGAGCCG GAGCGGACGC CCGTTGGGCA 
TGGACCGAGT GACCGTGGTT TCTGTGTGGT 
CTCTTTGGAG GGTGCGCTCT CTGGCACGCG 
CCACGCGGGC CCCCCATCCA CATCGCGGCC 
GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
CTTCCTACTC AATATATCTG AGGCCCAGCC 
NTCTTTCTGG TTCCAGGCCT TGGATGCCAG 
GCGNTACTGG CAAATGCGGC CCCTGTTTCT 
I := CTACGGGGTG TTCCTCAAGA CGCACTGCCC 
P TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

O 

y ; | ACCCCCGTCG CCTGGTGCAG CTGCTCCGCC 

^ TCGTGCGGGC CTGCCTGCGC CGGCTGGTGC 

U AACGCCGCTT CCTCAGGAAC ACCAAGAAGT 

f CGCTGCAGGA GCTGACGTGG AAGATGAGCG 

M CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
TCCTGCACTG GCTGATGAGT GTGTACGTCG 

W CGGAGACCAC GTTTCAAAAG AACAGGCTCT 

Q 

f]j TGCAAAGCAT TGGAATCAGA CAGCACTTGA 
CAGAGGTCAG GCAGCATCGG GAAGCCAGGC 
TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
CGTTCCGCAG AGAAAAGAGG GCCGAGCGTC 
TGCTCAACTA CGAGCGGGCG CGGCGCCCCG 
ACGATATCCA CAGGGCCTGG CGCACCTTCG 
CTGAGCTGTA CTTTGTCAAG GTGGATGTGA 
GGCTCACGGA GGTCATCGCC AGCATCATCA 
ATGCCGTGGT CCAGAAGGCC GCCCATGGGC 
CTACCTTGAC AGACCTCCAG CCGTACATGC 



GCCGCCGCTG 


TACCAGCTCG 


GCGCTGCNAC 


600 


ACCCGAANGC 


GTCTGGGATC 


CAACGGGCCT 


660 


CCCTGGGCTG 


CCAGCCCCGG 


GTGCGAGGAG 


720 


GTTGCCCAAG 


AGGCCCAGGC 


GTGGCGCTGC 


780 


GGGGTCCTGG 


GCCCACCCGG 


GCAGGACGCC 


840 


GTCACCTGCC 


AGACCCGCCG 


AAGAAGCCAC 


900 


CCACTCCCAC 


CCATCCGTGG 


GCCGCCAGCA 


960 


ACCACGTCCT 


GGGACACGCC 


TTGTCCCCCG 


1020 


TCCTCAGGCG 


ACAAGNACAC 


TGCGNCCCTC 


1080 


TGACTGGCGT 


TCGGGAGGTT 


CGTGGAGACA 


1140 


GATTCCCCGC 


AGGTTGCCCC 


GCCTGCCCCA 


1200 


GGAGCTGCTT 


GGGAACCACG 


CGCAGTGCCC 


1260 


GCTGCGAGCT 


GCGGTCACCC 


CAGCAGCCGG 


1320 


TGTGGCGGCC 


CCCGAGGAGG 


AGGAACACAG 


1380 


AGCACAGCAG 


CCCCTGGCAG 


GTGTACGGCT 


1440 


CCCCAGGCCT 


CTGGGGCTCC 


AGGCACAACG 


1500 


TCATCTCCCT 


GGGGAAGCAT 


GCCAAGCTCT 


1560 


TGCGGGACTG 


CGCTTGGCTG 


CGCAGGAGCC 


1620 


ACCGTCTGCG 


TGAGGAGATC 


CTGGCCAAGT 


1680 


TCGAGCTGCT 


CAGGTCTTTC 


TTTTATGTCA 


1740 


TTTTCTACCG 


GAAGAGTGTC 


TGGAGCAAGT 


1800 


AGAGGGTGCA 


GCTGCGGGAG 


CTGTCGGAAG 


1860 


CCGCCCTGCT 


GACGTCCAGA 


CTCCGCTTCA 


1920 


TGAACATGGA 


CTACGTCGTG 


GGAGCCAGAA 


1980 


TCACCTCGAG 


GGTGAAGGCA 


CTGTTCAGCG 


2040 


GCCTCCTGGG 


CGCCTCTGTG 


CTGGGCCTGG 


2100 


TGCTGCGTGT 


GCGGGCCCAG 


GACCCGCCGC 


2160 


CGGGCGCGTA 


CGACACCATC 


CCCCAGGACA 


2220 


AACCCCAGAA 


CACGTACTGC 


GTGCGTCGGT 


2280 


ACGTCCGCAA 


GGCCTTCAAG 


AGCCACGTCT 


2340 


GACAGTTCGT 


GGCTCACCTG 


CAGGANAACA 


2400 
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L 





cjpp Pf2 rTr 21 r* 


ovjtA X AiU v_ vj X C 


GTCATCGAGC 


tv TV /~i nmnom/i 

AGAGCTCCTC 


CCTGAATGAG 


GCCAGCAGTG 


2460 




wv#v* x \~ x x VtfUn 


P fiTV TT P r ** i ' 7A 


f /"'f M'T'f TV '"P/**""^ 

Vubl ILAlbi 


GCCACCACGC 


CGTGCGCATC 


AGGGGCAAGT 


2520 




PPT7XPflTPP£ 


r ,r vnr , r*Ts.r , nr , n 
vjr X oCCAvjVjVjo 


AJ.CCCGCAGG 


GCTCCATCCT 


CTCCACGCTG 


> till Tfc M M jM 

CTCTGCAGCC 


2580 








a TAr'Tv Tvr , r , TV" ,r P 
AAC AAbrC Ibi 


TTGCGGGGAT 


TCGGCGGGAC 


GGGCTGCTCC 


2640 




L\J\~.\J X X X V30 1 


/"""i TV 7V 1 1 HI ill 1 /^i 

vjvjAXvjAX X 1L, 


TTGTTGGTGA 


CACCTCACCT 


CACCCACGCG 


AAAACCTTCC 


2700 




X v^>i\jvjAv_.v».L. X 


Gbr X UV-vjAGG X 


GTCCCTGAGT 


ATGGCTGCGT 


GGTGAACTTG 


CGGAAGACAG 


2760 




X Vjro X unnL X X 


LLC 1 0 X AvaAA 


GACGAGGCCC 


TGGGTGGCAC 


GGCTTTTGTT 


CAGATGCCGG 


2820 






AX XUL.UCIGvj 


X Gv-GGCLTGC 


TGCTGGATAC 


CCGGACCCTG 


GAGGTGCAGA 


2880 




/^IPftl7A fTfi /'ii 11^1 
oWVai-rA* XAt X \- 


CAvjC X AX VjL-C 


f f f 7V /T^T^^f tv 
CGGACCTCGA 


TCAGAGCCAG 


TCTCACCTTC 


AACCGCGGCT 


2940 




TP7A 7^C*CiCTCXCZ 

X V^rtrtUUL X17VJT 


n Af3T!7x 7A P7XTP 
VjAvjvjAAvJAX 


PPTPHPTa tv TV /"* 

cvj X V-vjUAAAC 


TCTTTGGGGT 


CTTGCGGCTG 


AAGTGTCACA 


3000 




uLLllji X 1L1 


GGA x x TGCAG 


GTG AACAGC C 


TCCAGACGGT 


GTGCACCAAC 


ATCTACAAGA 


3060 






GCAGGCGTAC 


AGGTTTCACG 


CATGTGTGCT GCAGCTCCCA 


TTTCATCAGC 


3120 




AAG I TTGGAA 


GAACCCCACA 


TTTTTCCTGC 


GCGTCATCTC 


TGACACGGCC 


TCCCTCTGCT 


3180 


O 


TV OrTT/**1/"*t TV ITl/*i^rTJ 

ACTCCATCCT 


GAAAGCCAAG 


AACGCAGGGA 


TGTCGCTGGG 


GGCCAAGGGC 


GCCGCCGGCC 


3240 




CTCTGCCCTC 


CGAGGCCGTG 


CAGTGGCTGT 


GCCACCAAGC 


ATTCCTGCTC 


AAGCTGACTC 


3300 


Bi 


GACAC CGTGx 


CACCTACGTG 


CCACTCCTGG 


GGTCACTCAG 


GACAGCCCAG 


ACGCAGCTGA 


3360 


U 


GTCGGAAGCx 


CCCGGGGACG 


ACGCTGACTG 


CCCTGGAGGC 


CGCAGCCAAC 


CCGGCACTGC 


3420 


f»ft 


CCTCAGACTT 


CAAGAC CATC 


CTGGACTGAT 


GGCCACCCGC 


CCACAGCCAG 


GCCGAGAGCA 


3480 


D 


GACAC CAGCA 


GCCCTGTCAC 


GCCGGGCTCT 


ACGTCCCAGG 


GAGGGAGGGG 


CGGCCCACAC 


3540 


i 'i 




ACCGC x GGGA 


GTCTGAGGCC 


TGAGTGAGTG 


TTTGGCCGAG 


GCCTGCATGT 


3600 


0 


prv n pwi tv ta 


GCTGAGTGTC 


CGGCTGAGGC 


CTGAGCGAGT 


GTCCAGCCAA 


GGGCTGAGTG 


3660 


fy 


X UUAGCACAC. 


CTGCCGTCTT 


CACTTCCCCA 


CAGGCTGGCG 


CTCGGCTCCA 


CCCCAGGGCC 


3720 




AGCTTTTCCT 


CACCAGGAGC 


CCGGCTTCCA 


CTCCCCACAT 


AGGAATAGTC 


CATCCCCAGA 


3780 




TTCGCCATTG 


TTCACCCCTC 


GCCCTGCCCT 


CCTTTGCCTT 


CCACCCCCAC 


CATCCAGGTG 


3840 




GAGACCCTGA 


GAAGGACCCT 


GGGAGCTCTG 


GGAATTTGGA 


GTGACCAAAG 


GTGTGCCCTG 


3900 




TACACAGGCG 


AGGACCCTGC 


ACCTGGATGG 


GGGTCCCTGT 


GGGTCAAATT 


GGGGGGAGGT 


3960 




GCTGTGGGAG 


TAAAATACTG 


AATATATGAG 


TTTTTCAGTT TTGAAAAAAA AAAAAAAAAA 


4020 



AAAAAAAAA 4029 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Ala Ala Leu Arg Pro Ala Ala His Val Gly Ser Pro Gly Pro Gly His 
1 5 10 15 

Pro Arg Asp Ala Ala Arg Ser Pro Leu Pro Ser Arg Ala Leu Pro Ala 
20 25 30 

Ala Gin Pro Leu Pro Arg Gly Ala Ala Ala Gly His Val Arg Ala Ala 
35 40 45 

Pro Gly Ala Pro Gly Leu Ala Ala Gly Ala Ala Arg Gly Pro Gly Gly 
50 55 60 

Phe Pro Arg Xaa Gly Gly Pro Xaa Xaa Gly Val Arg Ala Leu Gly Xaa 
65 70 75 80 

Xaa Ala Ala Pro Arg Arg Pro Leu Leu Pro Pro Gly Val Leu Pro Glu 
85 90 95 

Xaa Xaa Gly Gly Pro Ser Ala Ala Xaa Ala Val Arg Xaa Arg Arg Glu 
100 105 ~ no 

Xaa Arg Ala Gly Leu Arg Leu Arg Ala Ala Gly Arg Gly Pro Arg Gly 
115 120 125 

Pro Pro Arg Gly Leu His His Gin Arg Ala Gin Leu Pro Ala Gin His 
130 135 140 

Gly Asp Arg Arg Thr Ala Gly Glu Arg Gly Val Gly Ala Ala Ala Ala 
145 150 155 ~ 160 

Pro Arg Gly Arg Arg Arg Ala Gly Ser Pro Ala Gly Thr Leu Arg Xaa 
165 170 175 

Xaa Cys Ala Gly Gly Ser Gin Leu Arg Leu Pro Xaa Val Arg Ala Ala 
180 185 190 

Ala Val Pro Ala Arg Arg Cys Xaa Ser Gly Pro Ala Pro Ala Thr Arg 
195 200 205 

Xaa Trp Thr Arg Xaa Arg Leu Gly Ser Asn Gly Pro Gly Thr He Ala 
210 215 220 

Ser Gly Arg Pro Gly Ser Pro Trp Ala Ala Ser Pro Gly Cys Glu Glu 
225 230 235 - 240 

Ala Arg Gly Gin Cys Gin Pro Lys Ser Ala Val Ala Gin Glu Ala Gin 
245 250 255 

Ala Trp Arg Cys Pro 
260 



INFORMATION FOR SEQ ID NO: 175: 



107 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: 

Ala Gly Ala Asp Ala Arg Trp Ala Gly Val Leu Gly Pro Pro Gly Gin 
1 5 10 15 

Asp Ala Trp Thr Glu 
20 



INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Pro Trp Phe Leu Cys Gly Val Thr Cys Gin Thr Arg Arg Arg Ser His 
1 5 10 15 

Leu Phe Gly Gly Cys Ala Leu Trp His Ala Pro Leu Pro Pro lie Arg 
20 25 30 

Gly Pro Pro Ala Pro Arg Gly Pro Pro He His He Ala Ala Thr Thr 
35 40 45 

Ser Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe 
50 55 60 

Leu Tyr Ser Ser Gly Asp Lys Xaa Thr Ala Xaa Leu Leu Pro Thr Gin 
65 70 75 80 

Tyr He 



INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
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Leu Ala Phe Gly Arg Phe Val Glu Thr Xaa Phe Leu Val Pro Gly Leu 
1 5 10 15 

Gly Cys Gin Asp Ser Pro Gin Val Ala Pro Pro Ala Pro Ala Xaa Leu 
20 25 30 

Ala Asn Ala Ala Pro Val Ser Gly Ala Ala Trp Glu Pro Arg Ala Val 
35 40 " 45 

Pro Leu Arg Gly Val Pro Gin Asp Ala Leu Pro Ala Ala Ser Cys Gly 
50 55 60 

His Pro Ser Ser Arg Cys Leu Cys Pro Gly Glu Ala Pro Gly Leu Cys 
« 70 " 75 80 

Gly Gly Pro Arg Gly Gly Gly Thr Gin Thr Pro Val Ala Trp Cys Ser 
85 90 95 

Cys Ser Ala Ser Thr Ala Ala Pro Gly Arg Cys Thr Ala Ser Cys Gly 
100 105 *" * 110 

Pro Ala Cys Ala Gly Trp Cys Pro Gin Ala Ser Gly Ala Pro Gly Thr 
115 120 125 

Thr Asn Ala Ala Ser Ser Gly Thr Pro Arg Ser Ser Ser Pro Trp Glv 
130 135 140 

Ser Met Pro Ser Ser Arg Cys Arg Ser 
145 150 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Ala Cys Gly Thr Ala Leu Gly Cys Ala Gly Ala Gin Gly Leu Ala Val 
1 5 io 15 

Phe Arg Pro Gin Ser Thr Val Cys Val Arg Arg Ser Trp Pro Ser Ser 
20 25 30 

Cys Thr Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 : 

Val Cys Thr Ser Ser Ser Cys Ser Gly Leu Ser Phe Met Ser Arg Arg 
15 10 15 

Pro Arg Phe Lys Arg Thr Gly Ser Phe Ser Thr Gly Arg Val Ser Gly 
20 25 30 

Ala Ser Cys Lys Ala Leu Glu Ser Asp Ser Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

U1 Arg Gly Cys Ser Cys Gly Ser Cys Arg Lys Gin Arg Ser Gly Ser He 

£ 1 5 10 15 



Gly Lys Pro Gly Pro Pro Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Arg Pro Asp Ser Ala Ser Ser Pro Ser Leu Thr Gly Cys Gly Arg Leu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

Thr Trp Thr Thr Ser Trp Glu Pro Glu Arg Ser Ala Glu Lys Arg Gly 
1 5 10 15 

Pro Ser Val Ser Pro Arg Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

Arg His Cys Ser Ala Cys Ser Thr Thr Ser Gly Arg Gly Ala Pro Ala 
1 5 10 ^ 15 



Ser Trp Ala Pro Leu Cys Trp Ala Trp Thr He Ser Thr Gly Pro Gly 
20 25 30 

P 

Lf! Ala Pro Ser Cys Cys Val Cys Gly Pro Arg Thr Arg Arg Leu Ser Cys 

35 40 45 

Thr Leu Ser Arg Trp Met 

H 50 



(2) INFORMATION FOR SEQ ID NO: 184: 



H (i) SEQUENCE CHARACTERISTICS: 

U (A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: 

Arg Ala Arg Thr Thr Pro Ser Pro Arg Thr Gly Ser Arg Arg Ser Ser 
1 5 10 ~ 15 

Pro Ala Ser Ser Asn Pro Arg Thr Arg Thr Ala Cys Val Gly Met Pro 
20 25 30 

Trp Ser Arg Arg Pro Pro Met Gly Thr Ser Ala Arg Pro Ser Arg Ala 
35 40 45 

Thr Ser Leu Pro 
50 

(2) INFORMATION FOR SEQ ID NO: 185: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

Gin Thr Ser Ser Arg Thr Cys Asp Ser Ser Trp Leu Thr Cys Arg Xaa 
1 5 io 15 

Thr Ala Arg 



INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Gly Met Pro Ser Ser Ser Ser Arg Ala Pro Pro 
1 5 io 



INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:187: 

Met Axg Pro Ala Val Ala Ser Ser Thr Ser Ser Tyr Ala Ser Cys Ala 
1 5 io " is 

Thr Thr Pro Cys Ala Ser Gly Ala Ser Pro Thr Ser Ser Ala Arg Gly 
20 25 30 

Ser Arg Arg Ala Pro Ser Ser Pro Arg Cys Ser Ala Ala Cys Ala Thr 
35 40 45 

Ala Thr Trp Arg Thr Ser Cys Leu Arg Gly Phe Gly Gly Thr Gly Cys 
50 55 60 

Ser Cys Val Trp Trp Met lie Ser Cys Trp 
65 70 
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INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:188: 

His Leu Thr Ser Pro Thr Arg Lys Pro Ser Ser Gly Pro Trp Ser Glu 
1 5 10 15 

Val Ser Leu Ser Met Ala Ala Trp 
20 



INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

Thr Cys Gly Arg Gin Trp 
1 5 



INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Lys Thr Arg Pro Trp Val Ala Arg Leu Leu Phe Arg Cys Arg Pro Thr 
1 5 10 15 

Ala Tyr Ser Pro Gly Ala Ala Cys Cys Trp He Pro Gly Pro Trp Arg 
20 25 3 0 

Cys Arg Ala Thr Thr Pro Ala Met Pro Gly Pro Pro Ser Glu Pro Val 
35 40 45 

Ser Pro Ser Thr Ala Ala Ser Arg Leu Gly Gly Thr Cys Val Ala Asn 
50 55 60 
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Ser Leu Gly Ser Cys Gly 
65 70 



kit 



03 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

Ser Val Thr Ala Cys Phe Trp lie Cys Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Thr Ala Ser Arg Arg Cys Ala Pro Thr Ser Thr Arg Ser Ser Cys Cys 
1 5 10 " 15 



Arg Arg Thr Gly Phe Thr His Val Cys Cys Ser Ser His Phe He Ser 

U 20 25 30 

Lys Phe Gly Arg Thr Pro His Phe Ser Cys Ala Ser Ser Leu Thr Arg 
35 40 45 



Pro Pro Ser Ala Thr Pro Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



Lys Pro Arg Thr Gin Gly Cys Arg Trp Gly Pro Arg Ala Pro Pro Ala 
15 10 15 
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Leu Cys Pro Pro Arg Pro Cys Ser Gly Cys Ala Thr Lys His Ser Cys 
20 25 30 



B 



Ser Ser 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Leu Asp Thr Val Ser Pro Thr Cys His Ser Trp Gly His Ser Gly Gin 
1 5 10 * 15 

Pro Arg Arg Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 195: 



Ul (i) SEQUENCE CHARACTERISTICS: 

i: p (A) LENGTH: 8 amino acids 

m (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



f* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

pi 

Val Gly Ser Ser Arg Gly Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 196 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Leu Pro Trp Arg Pro Gin Pro Thr Arg His Cys Pro Gin Thr Ser Arg 
15 10 15 

Pro Ser Trp Thr Asp Gly His Pro Pro Thr Ala Arg Pro Arg Ala Asp 
20 25 30 
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Thr Ser Ser Pro Val Thr Pro Gly Ser Thr Ser Gin Gly Gly Arg Gly 
35 40 45 

Gly Pro His Pro Gly Pro His Arg Trp Glu Ser Glu Ala 

50 55 60 



INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

Val Ser Val Trp Pro Arg Pro Ala Cys Pro Ala Glu Gly 
15 10 



INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Gly Leu Ser Glu Cys Pro Ala Lys Gly 
1 5 



INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

Val Ser Ser Thr Pro Ala Val Phe Thr Ser Pro Gin Ala Gly Ala Arg 
15 10 15 

Leu His Pro Arg Ala Ser Phe Ser Ser Pro Gly Ala Arg Leu Pro Leu 
20 25 30 

Pro Thr 
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(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

Ser He Pro Arg Phe Ala He Val His Pro Ser Pro Cys Pro Pro Leu 
15 10 15 

Pro Ser Thr Pro Thr He Gin Val Glu Thr Leu Arg Arg Thr Leu Gly 
20 25 30 

Ala Leu Gly He Trp Ser Asp Gin Arg Cys Ala Leu Tyr Thr Gly Glu 
35 40 ~ 45 

Asp Pro Ala Pro Gly Trp Gly Ser Leu Trp Val Lys Leu Gly Gly Gly 
50 55 ~ 60 

Ala Val Gly Val Lys Tyr 
65 70 

(2) INFORMATION FOR SEQ ID NO: 201: 



(i) SEQUENCE CHARACTERISTICS: 
H (A) LENGTH: 16 amino acids 

jn* (B) TYPE: amino acid 

(C) STRANDEDNESS: 
™ (D) TOPOLOGY: linear 

hid 

H* (ii) MOLECULE TYPE: peptide 

m 

2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: 

ill He Tyr Glu Phe Phe Ser Phe Glu Lys Lys Lys Lys Lys Lys Lys Lys 

1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

Gin Arg Cys Val Leu Leu Arg Thr Trp Glu Ala Leu Ala Pro Ala Thr 
15 10 15 
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Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu 
20 25 30 

Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg 
35 40 45 

Leu Gly Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala 
50 55 60 

Phe Arg Ala Xaa Val Ala Xaa Cys Xaa Val Cys Val Pro Trp Xaa Xaa 
65 70 75 80 

Xaa Xaa Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Xaa 
85 90 95 

Xaa Leu Val Ala Arg Val Leu Xaa Xaa Leu Cys Xaa Arg Gly Ala Xaa 
100 105 no 

Asn Val Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Glv 
115 120 125 

Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr 
130 135 140 

Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg 
145 150 155 160 

Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Xaa 
165 170 175 

Phe Val Leu Val Xaa Pro Ser Cys Ala Tyr Xaa Val Cys Gly Pro Pro 
180 185 190 

Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala 
195 200 ^ 205 

Xaa Gly Pro Glu Xaa Val Trp Asp Pro Thr Gly Leu Glu Pro 
210 215 220 

INFORMATION FOR SEQ ID NO: 2 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 03: 

Arg Gin Gly Gly Arg Gly Pro Pro Gly Leu Pro Ala Pro Gly Ala Arg 
15 10 15 

Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro 
20 25 30 

Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly 
35 40 45 
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Ser Trp Ala His Pro Gly Arg Thr Pro Gly Pro Ser Asp Arg Gly Phe 
50 55 60 

Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu 
55 70 75 80 

Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin 
85 90 95 

His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Gly Thr 
100 105 no 

Arg Leu Val Pro Arg Cys Thr Pro Arg Pro Ser Thr Ser Ser Thr Pro 
115 120 125 

Gin Ala Thr Xaa Thr Leu Arg Pro Ser Phe Leu Leu Asn He Ser Glu 
130 135 140 

Ala Gin Pro Asp Trp Arg Ser Gly Gly Ser Trp Arg Xaa Ser Phe Trp 
145 150 155 160 

Phe Gin Ala Leu Asp Ala Arg He Pro Arg Arg Leu Pro Arg Leu Pro 
165 170 175 

Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn 
180 185 190 

His Ala Gin Cys Pro Tyr Gly Val Phe Leu Lys Thr His Cys Pro Leu 
195 200 * 205 

Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro 
210 215 220 

Gin Gly Ser Val Ala Ala Pro Glu Glu Glu Glu His Arg Pro Pro Ser 
225 230 235 ^ 240 

Pro Gly Ala Ala Ala Pro Pro Ala Gin Gin Pro Leu Ala Gly Val Arg 
245 250 255 

Leu Arg Ala Gly Leu Pro Ala Pro Ala Gly Ala Pro Arg Pro Leu Gly 
260 265 270 

Leu Gin Ala Gin Arg Thr Pro Leu Pro Gin Glu His Gin Glu Val His 
275 280 285 

Leu Pro Gly Glu Ala Cys Gin Ala Leu Ala Ala Gly Ala Asp Val Glu 
290 295 300 



Asp Glu Arg Ala Gly Leu Arg Leu Ala Ala Gin Glu Pro Arg Gly Trp 
305 310 315 320 

Leu Cys Ser Gly Arg Arg Ala Pro Ser Ala 
325 330 



INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

Gly Asp Pro Gly Gin Val Pro Ala Leu Ala Asp Glu Cys Val Arg Arg 
15 10 15 

Arg Ala Ala Gin Val Phe Leu Leu Cys His Gly Asp His Val Ser Lys 
20 25 " 30 

Glu Gin Ala Leu Phe Leu Pro Glu Glu Cys Leu Glu Gin Val Ala Lys 
35 40 " 45 

His Trp Asn Gin Thr Ala Leu Glu Glu Gly Ala Ala Ala Gly Ala Val 
50 55 60 

Gly Ser Arg Gly Gin Ala Ala Ser Gly Ser Gin Ala Arg Pro Ala Asp 
65 70 75 80 

Val Gin Thr Pro Leu His Pro Gin Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 205: 

p (i) SEQUENCE CHARACTERISTICS: 

~~ (A) LENGTH: 76 amino acids 

^ (B) TYPE: amino acid 

U1 (C) STRANDEDNESS : 

J* (D) TOPOLOGY: linear 

ill 

J"; (ii) MOLECULE TYPE: peptide 



p 

0 

(1J 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

Arg Ala Ala Ala Asp Cys Glu His Gly Leu Arg Arg Gly Ser Gin Asn 
1 5 10 15 

Val Pro Gin Arg Lys Glu Gly Arg Ala Ser His Leu Glu Gly Glu Gly 
20 25 30 

Thr Val Gin Arg Ala Gin Leu Arg Ala Gly Ala Ala Pro Arg Pro Pro 
35 40 45 

Gly Arg Leu Cys Ala Gly Pro Gly Arg Tyr Pro Gin Gly Leu Ala His 
50 55 60 

Leu Arg Ala Ala Cys Ala Gly Pro Gly Pro Ala Ala 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

Ala Val Leu Cys Gin Gly Gly Cys Asp Gly Arg Val Arg His His Pro 
15 10 15 

Pro Gly Gin Ala His Gly Gly His Arg Gin His His Gin Thr Pro Glu 
20 25 30 

His Val Leu Arg Ala Ser Val Cys Arg Gly Pro Glu Gly Arg Pro Trp 
35 40 ~ * 45 

Ala Arg Pro Gin Gly Leu Gin Glu Pro Arg Leu Tyr Leu Asp Arg Pro 
50 55 60 

Pro Ala Val His Ala Thr Val Arg Gly Ser Pro Ala Gly Xaa Gin Pro 
65 70 75 * 80 

Ala Glu Gly Cys Arg Arg His Arg Ala Glu Leu Leu Pro Glu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
H* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

e Gly Gin Gin Trp Pro Leu Arg Arg Leu Pro Thr Leu His Val Pro Pro 

■pn 15 10 15 

P R Arg Arg Ala His Gin Gly Gin Val Leu Arg Pro Val Pro Gly Asp Pro 

H* 20 25 30 



til 



Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu Arg Arg 
35 40 45 

His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala Ala Pro 
50 55 60 

Ala Phe Gly Gly 
65 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 
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Phe Leu Val Gly Asp Thr Ser Pro His Pro Arg Glu Asn Leu Pro Gin 
15 10 15 

Asp Pro Gly Pro Arg Cys Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

Glu Asp Ser Gly Glu Leu Pro Cys 
10 15 

Gly Phe Cys Ser Asp Ala Gly Pro 
25 30 

Ala Ala Gly Tyr Pro Asp Pro Gly 
45 

Cys Pro Asp Leu His Gin Ser Gin 
60 

Gly Trp Glu Glu His Ala Ser Gin 
75 80 

Val Ser Gin Pro Val Ser Gly Phe 
90 95 

Val His Gin His Leu Gin Asp Pro 
105 110 

Arg Met Cys Ala Ala Ala Pro lie 
125 

His lie Phe Pro Ala Arg His Leu 
140 



(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

His Gly Leu Pro Leu Leu Leu His Pro Glu Ser Gin Glu Arg Arg Asp 
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Val Trp Leu Arg Gly Glu Leu Ala 
1 5 

Arg Arg Arg Gly Pro Gly Trp His 
20 

Arg Pro lie Pro Leu Val Arg Pro 
35 40 

Gly Ala Glu Arg Leu Leu Gin Leu 
50 55 



Ser His Leu Gin Pro Arg Leu Gin 

^ 65 70 

L Thr Leu Trp Gly Leu Ala Ala Glu 

M 85 

]** 

I?* Ala Gly Glu Gin Pro Pro Asp Gly 

m. 100 



W Pro Ala Ala Gly Val Gin Val Ser 

fU 115 120 

Ser Ser Ala Ser Leu Glu Glu Pro 

130 135 



(2) INFORMATION FOR SEQ ID NO: 2 10: 
(i) SEQUENCE CHARACTERISTICS: 



V < ft 



15 10 15 

Val Ala Gly Gly Gin Gly Arg Arg Arg Pro Ser Ala Leu Arg Gly Arg 
20 25 30 

Ala Val Ala Val Pro Pro Ser He Pro Ala Gin Ala Asp Ser Thr Pro 
35 40 45 

Cys His Leu Arg Ala Thr Pro Gly Val Thr Gin Asp Ser Pro Asp Ala 
50 55 60 

Ala Glu Ser Glu Ala Pro Gly Asp Asp Ala Asp Cys Pro Gly Gly Arg 
65 70 " 75 80 

Ser Gin Pro Gly Thr Ala Leu Arg Leu Gin Asp His Pro Gly Leu Met 
85 90 ^ 95 

Ala Thr Arg Pro Gin Pro Gly Arg Glu Gin Thr Pro Ala Ala Leu Ser 
100 105 110 

Arg Arg Ala Leu Arg Pro Arg Glu Gly Gly Ala Ala His Thr Gin Ala 
115 120 125 

Arg Thr Ala Gly Ser Leu Arg Pro Glu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

Val Phe Gly Arg Gly Leu His Val Arg Leu Lys Ala Glu Cys Pro Ala 
Ml 5 10 15 

W 

Glu Ala 

(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 

Ala Ser Val Gin Pro Arg Ala Glu Cys Pro Ala His Leu Pro Ser Ser 
15 10 15 



PI 
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Leu Pro His Arg Leu Ala Leu Gly Ser Thr Pro Gly Pro Ala Phe Pro 
20 25 " 30 

His Gin <31u Pro Gly Phe His Ser Pro His Arg Asn Ser Pro Ser Pro 
35 40 ~ 45 

Asp Ser Pro Leu Phe Thr Pro Arg Pro Ala Leu Leu Cys Leu Pro Pro 
50 55 60 

Pro Pro Ser Arg Trp Arg Pro 
65 70 



INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

Glu Gly Pro Trp Glu Leu Trp Glu Phe Gly Val Thr Lys Gly Val Pro 
1 5 10 ~ 15 

Cys Thr Gin Ala Arg Thr Leu His Leu Asp Gly Gly Pro Cys Gly Ser 
20 25 4 30 

Asn Trp Gly Glu Val Leu Trp Glu 
35 40 



INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

Asn Thr Glu Tyr Met Ser Phe Ser Val Leu Lys Lys Lys Lys Lys Lys 
1 5 10 A 15 

Lys Lys 



INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

Ser Ala Ala Ser Cys Cys Ala Arg Gly Lys Pro Trp Pro Arg Pro Pro 
1 5 10 15 

Pro Arg Cys Arg Ala Leu Pro Ala Ala Glu Pro Cys Ala Pro Cys Cys 
20 25 30 

Ala Ala Thr Thr Ala Arg Cys Cys Arg Trp Pro Arg Ser Cys Gly Ala 
35 40 45 

Trp Gly Pro Arg Ala Gly Gly Trp Cys Ser Ala Gly Thr Arg Arg Leu 
50 55 60 

Ser Ala Arg Trp Trp Pro Xaa Ala Trp Cys Ala Cys Pro Gly Xaa Xaa 
65 70 75 80 

Gly Xaa Pro Pro Pro Pro Pro Pro Ser Ala Arg Cys Pro Ala 
85 90 



INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Xaa Xaa Trp Trp Pro Glu Cys Cys Xaa Xaa Cys Ala Xaa Ala Ala Arg 
15 10 15 

Xaa Thr Cys Trp Pro Ser Ala Ser Arg Cys Trp Thr Gly Pro Ala Gly 
20 25 ' 30 

Ala Pro Pro Arg Pro Ser Pro Pro Ala Cys Ala Ala Thr Cys Pro Thr 
35 40 45 

Arg 

INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 
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Pro Thr His Cys Gly Gly Ala Gly Arg Gly Gly Cys Cys Cys Ala Ala 
1 5 10 15 

Trp Ala Thr Thr Cys Trp Phe Thr Cys Trp His Ala Ala Arg Xaa Leu 
20 25 30 

Cys Trp Trp Xaa Pro Ala Ala Pro Thr Xaa Cys Ala Gly Arg Arg Cys 
35 40 45 

Thr Ser Ser Ala Leu Xaa Leu Arg Pro Gly Pro Arg His Thr Leu Xaa 
50 55 60 

Asp Pro Xaa Ala Ser Gly He Gin Arg Ala Trp Asn His Ser Val Arg 
65 70 75 80 

Glu Ala Gly Val Pro Leu Gly Cys Gin Pro Arg Val Arg Gly Gly Ala 
85 90 " 95 

Gly Ala Val Pro Ala Glu Val Cys Arg Cys Pro Arg Gly Pro Gly Val 
100 105 "* no 

Ala Leu Pro Leu Ser Arg Ser Gly Arg Pro Leu Gly Arg Gly Pro Gly 
115 120 125 

Pro Thr Arg Ala Gly Arg Leu Asp Arg Val Thr Val Val Ser Val Trp 
130 135 140 

Cys His Leu Pro Asp Pro Pro Lys Lys Pro Pro Leu Trp Arg Val Arg 
145 150 155 ~ ~ 160 

Ser Leu Ala Arg Ala Thr Pro Thr His Pro Trp Ala Ala Ser Thr Thr 
165 170 175 

Arg Ala Pro His Pro His Arg Gly His His Val Leu Gly His Ala Leu 
180 185 190 

Ser Pro Gly Val Arg Arg Asp Gin Ala Leu Pro Leu Leu Leu Arg Arg 
195 200 205 

Gin Xaa His Cys Xaa Pro Pro Ser Tyr Ser He Tyr Leu Arg Pro Ser 
210 215 220 

Leu Thr Gly Val Arg Glu Val Arg Gly Asp Xaa Leu Ser Gly Ser Arg 
225 230 235 240 

Pro Trp Met Pro Gly Phe Pro Ala Gly Cys Pro Ala Cys Pro Ser Xaa 
245 250 * 255 

Thr Gly Lys Cys Gly Pro Cys Phe Trp Ser Cys Leu Gly Thr Thr Arg 
260 265 270 

Ser Ala Pro Thr Gly Cys Ser Ser Arg Arg Thr Ala Arg Cys Glu Leu 
275 280 285 

Arg Ser Pro Gin Gin Pro Val Ser Val Pro Gly Arg Ser Pro Arg Ala 
290 295 300 



Leu Trp Arg Pro Pro Arg Arg Arg Asn Thr Asp Pro Arg Arg Leu Val 

305 310 315 ~ ^ 320 

Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 

325 330 ' 335 
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Arg Ala Cys Leu Arg Arg Leu Val 
340 

His Asn Glu Arg Arg Phe Leu Arg 
355 360 

Gly Lys His Ala Lys Leu Ser Leu 
370 375 

Val Arg Asp Cys Ala Trp Leu Arg 
385 390 

Pro Ala Ala Glu His Arg Leu Arg 
405 

His Trp Leu Met Ser Val Tyr Val 
420 

Tyr Val Thr Glu Thr Thr Phe Gin 
435 440 

Lys Ser Val Trp Ser Lys Leu Gin 
450 455 



Pro Pro Gly Leu Trp Gly Ser Arg 
345 350 

Asn Thr Lys Lys Phe lie Ser Leu 
365 

Gin Glu Leu Thr Trp Lys Met Ser 
380 

Arg Ser Pro Gly Val Gly Cys Val 
395 ** ' 400 

Glu Glu lie Leu Ala Lys Phe Leu 
410 " 415 

Val Glu Leu Leu Arg Ser Phe Phe 
425 430 

Lys Asn Arg Leu Phe Phe Tyr Arg 
445 

Ser He Gly He Arg Gin His Leu 
460 



Lys Arg Val Gin Leu Arg Glu Leu 
465 470 

Arg Glu Ala Arg Pro Ala Leu Leu 
485 

Lys Pro Asp Gly Leu Arg Pro He 
500 

Ala Arg Thr Phe Arg Arg Glu Lys 
515 520 



Ser Glu Ala Glu Val Arg Gin His 
475 ~ 480 

Thr Ser Arg Leu Arg Phe He Pro 
490 495 

Val Asn Met Asp Tyr Val Val Gly 
505 510 

Arg Ala Glu Arg Leu Thr Ser Arg 
525 



Val Lys Ala Leu Phe Ser Val Leu 
530 535 



Asn Tyr Glu Arg Ala Arg Arg Pro 
540 



Gly Leu Leu Gly Ala Ser Val Leu 
545 550 

Trp Arg Thr Phe Val Leu Arg Val 
565 

Leu Tyr Phe Val Lys Val Asp Val 
580 

Gin Asp Arg Leu Thr Glu Val He 
595 600 

Thr Tyr Cys Val Arg Arg Tyr Ala 
610 615 



Gly Leu Asp Asp He His Arg Ala 
555 560 

Arg Ala Gin Asp Pro Pro Pro Glu 
570 575 

Thr Gly Ala Tyr Asp Thr He Pro 
585 590 

Ala Ser He He Lys Pro Gin Asn 
605 

Val Val Gin Lys Ala Ala His Gly 
620 



His Val Arg Lys Ala Phe Lys Ser 
625 630 

Gin Pro Tyr Met Arg Gin Phe Val 
645 



His Val Ser Thr Leu Thr Asp Leu 

635 640 

Ala His Leu Gin Xaa Asn Ser Pro 
650 655 



Leu Arg Asp Ala Val Val He Glu 
660 



Gin Ser Ser Ser Leu Asn Glu Ala 
665 670 
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Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 
675 680 685 

Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin 
690 695 700 

Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
705 710 715 720 

Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu Arg 
725 730 " " 735 

Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
740 745 750 

Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val 
755 760 765 

Val A3n Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala 
770 775 780 

Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
785 790 795 " 800 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
805 810 815 

Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe Asn 
820 825 830 

Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val 
835 840 845 

Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
850 855 860 

Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin Ala 
865 870 875 880 

Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
885 890 895 

Tip Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala Ser 
900 905 910 

Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
915 920 925 

Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
930 935 940 

Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 
945 950 955 " 960 

Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg 
965 970 975 

Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
980 985 990 

Ala Leu Pro Ser Asp Phe Lys Thr He Leu Asp 
995 " 1000 
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(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SE0 ID NO: 218: 

Trp Pro Pro Ala His Ser Gin Ala Glu Ser Arg His Gin Gin Pro Cys 
1 5 10 15 

His Ala Gly Leu Tyr Val Pro Gly Arg Glu Gly Arg Pro Thr Pro Arg 
20 25 ^ 30 

Pro Ma Pro Leu Gly Val 
35 

u (2) INFORMATION FOR SEQ ID NO: 219: 

y (i) SEQUENCE CHARACTERISTICS: 
CI (A) LENGTH: 13 amino acids 

Ul (B) TYPE: amino acid 

p (C) STRANDEDNESS : 

;3J. (D) TOPOLOGY: linear 

y * 

h* (ii) MOLECULE TYPE: peptide 

L (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

Gly Leu Ser Glu Cys Leu Ala Glu Ala Cys Met Ser Gly 

ft* . 1 5 10 

W (2) INFORMATION FOR SEQ ID NO: 220: 

ru 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

Arg Leu Ser Val Arg Leu Arg Pro Glu Arg Val Ser Ser Gin Gly Leu 
15 10 15 

Ser Val Gin His Thr Cys Arg Leu His Phe Pro Thr Gly Trp Arg Ser 
20 25 30 

Ala Pro Pro Gin Gly Gin Leu Phe Leu Thr Arg Ser Pro Ala Ser Thr 
35 40 45 
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Pro His He Gly He Val His Pro Gin He Arg His Cys Ser Pro Leu 
50 55 60 

Ala Leu Pro Ser Phe Ala Phe His Pro His His Pro Gly Gly Asp Pro 
65 70 75 " * 80 

Glu Lys Asp Pro Gly Ser Ser Gly Asn Leu Glu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

Pro Lys Val Cys Pro Val His Arg Arg Gly Pro Cys Thr Trp Met Gly 
15 10 15 



Val Pro Val Gly Gin He Gly Gly Arg Cys Cys Gly Ser Lys He Leu 
0 20 25 * 30 



Asn He 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
U (C) STRANDEDNESS: 

j»* (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Val Phe Gin Phe 
1 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 
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Lys Lys Lys Lys Lys Lys Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4015 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 56.. 3454 

(D) OTHER INFORMATION : /product^ "hTRT" 

/note= "human telomerase reverse 
transcriptase (hTRT) catalytic protein 
component" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 58 

Met 
1 

CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC 106 
Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 
5 10 15 

TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 154 
Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 
20 25 ' 30 

CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 202 
Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 
35 40 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 250 
Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 298 
Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 346 
Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 
85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 394 
Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 110 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 442 
Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr Asp 
115 120 125 
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GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC 490 
Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 
130 135 140 145 

GAG GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 538 
Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 155 160 

GTG GCT CCC AGC TGC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 586 
Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 
165 170 * 175 

CTC GGC GCT GCC ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC 634 
Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 
180 185 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 682 
Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 
195 200 205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 730 
Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210 215 220 225 

V* GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 778 

P Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly 
P 230 235 240 

U1 GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 826 
£i Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 
m 245 250 255 



fa 



CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT GTG GTG 874 
His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 
260 265 270 

TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 922 
Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 280 285 



Q TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 970 
m Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
W 290 295 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 1018 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAC TCC TCA GGC GAC 1066 
Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 

AAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser 
340 345 350 

CTG ACT GGC GCT CGG AGG CTC GTG GAG ACC ATC TTT CTG GGT TCC AGG 1162 
Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser Arg 
355 360 365 

CCC TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 
Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 
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370 



375 



380 



385 



TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG 1258 
Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 

CAG TGC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 1306 
Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 
405 410 415 

GCG GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC 1354 
Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420 425 " 430 

TCT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG 1402 
Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
435 440 " " 445 

CAG CTG CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 1450 
Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
450 455 460 465 

CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 1498 
Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 

□ CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC CTG 1546 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser Leu 
t% 485 490 495 

m GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG ATG AGC 1594 
pi Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
* 500 505 510 

f ' h GTG CGG GA^ TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT 1642 

^ : Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
0 515 520 525 

h4 

L ft CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 1690 

y Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe Leu 

W 530 535 540 545 
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CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG CTC AGG TCT TTC TTT 1738 
His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG CTC TTT TTC TAC CC3G 1786 
Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
565 570 575 

AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AGA CAG CAC TTG 1834 
Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu 
580 * 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 1882 
Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 
595 600 605 

CGG GAA GCC AGG CCC GCC CTG CTG ACG TCC AGA CTC CGC TTC ATC CCC 1930 
Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro 
610 615 620 625 
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AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 1978 
Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 
630 635 640 

GCC AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG 2026 
Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
645 650 655 

GTG AAG GCA CTG TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC 2074 
Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
660 665 670 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 2122 
Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala 
675 680 685 

TGG CGC ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 2170 
Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
690 695 700 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 2218 
Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He Pro 
710 715 720 

. , CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 2266 

r* Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin Asn 
Q 725 730 735 



4* 

m 



ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 2314 
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 " 745 750 



\& CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG ACA GAC CTC 2362 

L| His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu 
755 " 760 765 

0 CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG GAG ACC AGC CCG 2410 

M Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser Pro 
^ 770 775 780 785 

jj?'- CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC TCC CTG AAT GAG GCC 2458 
0 Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu Ala 
fjj 790 795 800 

AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC TTC ATG TGC CAC CAC GCC 2506 
Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 
805 810 815 

GTG CGC ATC AGG GGC AAG TCC TAC GTC CAG TGC CAG GGG ATC CCG CAG 2554 
Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin 
820 ~ * 825 830 

GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC CTG TGC TAC GGC GAC ATG 2602 
Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
835 840 845 

GAG AAC AAG CTG TTT GCG GGG ATT CGG CGG GAC GGG CTG CTC CTG CGT 2650 
Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu Arg 
850 ' 855 860 865 

TTG GTG GAT GAT TTC TTG TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA 2698 
Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
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870 



875 



880 





ACC 
Thr 


TTC CTC AGG ACC 
Phe Leu Arg Thr 

o o c 

885 


CTG 
Leu 


GTC 
Val 


CGA GGT 
Arg Gly 
890 


GTC 
Val 


CCT 
Pro 


GAG 
Glu 


TAT GGC 
Tyr Gly 

one 

895 


TGC 
Cys 


GTG 
Val 




GTG 
Val 


AAC TTG CGG AAG 
Asn Leu Arg Lys 
900 


ACA 
Thr 


GTG 
Val 


GTG 
Val 
905 


AAC 
Asn 


TTC 
Phe 


CCT 
Pro 


GTA 
Val 


GAA GAC 
Glu Asp 
910 


GAG 
Glu 


GCC 
Ala 




CTG 
Leu 


GGT GGC ACG GCT 
Gly Gly Thr Ala 
915 


TTT 
Phe 


GTT 
Val 
920 


CAG 
Gin 


ATG 
Met 


CCG 
Pro 


GCC 
Ala 


CAC 
His 
925 


GGC CTA 
Gly Leu 


TTC 
Phe 


CCC 
Pro 




TGG 
Trp 
930 


TGC GGC CTG CTG 
Cys Gly Leu Leu 


CTG 
Leu 
935 


GAT 
Asp 


ACC CGG 
Thr Arg 


ACC 
Thr 


CTG 
Leu 
940 


GAG 
Glu 


GTG CAG 
Val Gin 


AGC 
Ser 


GAC 
Asp 
945 




TAC 
Tyr 


TCC AGC TAT GCC 
Ser Ser Tyr Ala 
950 


CGG 
Arg 


ACC 
Thr 


TCC 
Ser 


ATC 
He 


AGA 
Arg 
955 


GCC 
Ala 


AGT 
Ser 


CTC ACC 
Leu Thr 


TTC 
Phe 
960 


AAC 
Asn 




CGC 
Arg 


GGC TTC AAG GCT 
Gly Phe Lys Ala 
965 


GGG 
Gly 


AGG 
Arg 


AAC 
Asn 


ATG 
Met 
970 


CGT 
Arg 


CGC 
Arg 


AAA 
Lys 


CTC TTT 
Leu Phe 
975 


GGG 
Gly 


GTC 
Val 


a 

D 

m 


TTG 
Leu 


CGG CTG AAG TGT 
Arg Leu Lys Cys 
980 


CAC 
His 


AGC 
Ser 


CTG 
Leu 
985 


TTT 
Phe 


CTG 
Leu 


GAT 
Asp 


TTG 
Leu 


CAG GTG 
Gin Val 
990 


AAC 
Asn 


AGC 
Ser 


m 


CTC 
Leu 


CAG ACG GTG TGC 
Gin Thr Val Cys 
995 


ACC 
Thr 


AAC ATC TAC 
Asn lie Tyr 
1000 


AAG 
Lys 


ATC 
He 


CTC CTG CTG 
Leu Leu Leu 
1005 


CAG 
Gin 


GCG 
Ala 




TAC AGG TTT CAC GCA 
Tyr Arg Phe His Ala 
1010 


TGT GTG 
Cys Val 
1015 


CTG 
Leu 


CAG 
Gin 


CTC 
Leu 


CCA TTT 
Pro Phe 
1020 


CAT CAG 
His Gin 


CAA 
Gin 


GTT 
Val 
1025 


D3 


TGG 
Trp 


AAG AAC CCC ACA TTT 
Lys Asn Pro Thr Phe 
1030 


TTC 

Phe 


CTG CGC 
Leu Arg 


GTC ATC 
Val He 
1035 


TCT 
Ser 


GAC ACG 
Asp Thr 


GCC TCC 
Ala Ser 
1040 


n 


CTC 
Leu 


TGC TAC TCC ATC 
Cys Tyr Ser lie 
1045 


CTG 
Leu 


AAA 
Lys 


GCC 
Ala 


AAG AAC 
Lys Asn 
1050 


GCA 
Ala 


GGG 
Gly 


ATG TCG CTG 
Met Ser Leu 
1055 


GGG 
Gly 




GCC 
Ala 


AAG GGC GCC GCC 
Lys Gly Ala Ala 
1060 


GGC 
Gly 


CCT 
Pro 


CTG CCC 
Leu Pro 
1065 


TCC 
Ser 


GAG 
Glu 


GCC 
Ala 


GTG CAG 
Val Gin 
1070 


TGG 
Trp 


CTG 
Leu 




TGC 
Cys 


CAC CAA GCA TTC 
His Gin Ala Phe 
1075 


CTG 
Leu 


CTC AAG 
Leu Lys 
1080 


CTG 
Leu 


ACT 
Thr 


CGA 
Arg 


CAC CGT GTC 
His Arg Val 
1085 


ACC 
Thr 


TAC 
Tyr 




GTG CCA CTC CTG GGG 
Val Pro Leu Leu Gly 
1090 


TCA CTC AGG 
Ser Leu Arg 
1095 


ACA 
Thr 


GCC 
Ala 


CAG ACG 
Gin Thr 
1100 


CAG CTG 
Gin Leu 


AGT 
Ser 


CGG 
Arg 
1105 




AAG 
Lys 


CTC CCG GGG ACG ACG 
Leu Pro Gly Thr Thr 
1110 


CTG 
Leu 


ACT 
Thr 


GCC 
Ala 


CTG GAG GCC 
Leu Glu Ala 
1115 


GCA GCC 
Ala Ala 


AAC CCG 
Asn Pro 
1120 



2746 



2794 



2842 



2890 



2938 



2986 



3034 



3082 



3130 



3178 



3226 



3274 



3322 



3370 



3418 
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•GCA CTG CCC TCA GAC TTC AAG ACC ATC CTG GAC TGATGGCCAC CCGCCCACAG 3471 
Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1125 1130 

CCAGGCCGAG AGCAGACACC AGCAGCCCTG TCACGCCGGG CTCTACGTCC CAGGGAGGGA 3531 

GGGGCGGCCC ACACCCAGGC CCGCACCGCT GGGAGTCTGA GGCCTGAGTG AGTGTTTGGC 3591 

CGAGGCCTGC ATGTCCGGCT GAAGGCTGAG TGTCCGGCTG AGGCCTGAGC GAGTGTCCAG 3651 

CCAAGGGCTG AGTGTCCAGC ACACCTGCCG TCTTCACTTC CCCACAGGCT GGCGCTCGGC 3711 

TCCACCCCAG GGCCAGCTTT TCCTCACCAG GAGCCCGGCT TCCACTCCCC ACATAGGAAT 3771 

AGTCCATCCC CAGATTCGCC ATTGTTCACC CCTCGCCCTG CCCTCCTTTG CCTTCCACCC 3831 

CCACCATCCA GGTGGAGACC CTGAGAAGGA CCCTGGGAGC TCTGGGAATT TGGAGTGACC 3891 

AAAGGTGTGC CCTGTACACA GGCGAGGACC CTGCACCTGG ATGGGGGTCC CTGTGGGTCA 3951 

AATTGGGGGG AGGTGCTGTG GGAGTAAAAT ACTGAATATA TGAGTTTTTC AGTTTTGAAA 4011 

AAAA 4015 

h* (2) INFORMATION FOR SEQ ID NO: 225: 

D 

Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
15 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 



in 



J Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 40 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 * 55 60 

Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 70 75 80 

Val Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
85 90 ' ' * 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 ~ 125 

Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 * 135 ~ 140 
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Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
145 150 155 ~ 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 170 " 175 

Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
180 185 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
195 200 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 215 220 

Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 
225 230 235 * 240 

Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 

Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 * 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser 
355 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 375 380 

Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
385 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 " 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
435 440 " 445 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
450 455 ' 460 



Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
465 470 475 480 
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Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe He Ser 
485 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 520 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe 
530 535 540 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 " 550 555 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 
565 570 575 

Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His 
580 585 590 

Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg 
675 * 680 685 

Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 710 715 720 

Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin 
725 730 735 

Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 
740 745 750 

Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 
755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 775 780 

Pro Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu 
785 790 795 800 



Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His 
805 810 815 
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Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly lie Pro 
820 825 830 

Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
835 840 845 

Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu 
850 855 860 

Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 
865 870 875 880 

Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys 
885 890 * 895 

Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu 
900 905 910 

Ala Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 935 940 

Asp Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe 
945 950 955 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
965 ~ 970 * *" 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
980 985 990 

Ser Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin 
995 1000 1005 

Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 1015 1020 

Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala 
1025 " " 1030 1035 1040 

Ser Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 
1045 1050 1055 

Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp 
1060 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 
1075 * 1080 1085 

Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 ~ 1100 

Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn 
1105 1110 1115 1120 

Pro Ala Leu Pro Ser Asp Phe Lys Thr He Leu Asp 
1125 1130 
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